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NOVEL PHOSPHODIESTERASE INTERACTING PROTEINS 
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invention. 

INTRODUCTION 

Field of the Invention 

The field of the invention is cyclic nucleotide phosphodiesterases, particularly cAMP 
phosphodiesterases. 
Background of the Tnventtnn 

Cyclic nucleotide phosphodiesterases are a class of enzymes that catalyze the 
hydrolysis of phosphodiester bonds in cyclic nucleotides, e.g. cAMP. Cyclic nucleotides are 
important second messengers that regulate and mediate a number of cellular responses to 
extracellular signals, such as hormones, light and neurotransmitters. Since cyclic nucleotide 
phosphodiesterases modulate the concentration of cyclic nucleotides, these enzymes play a 
significant role in signal transductioa There are at least ten different classes of cyclic 
phosphodiesterases, seven of which are: (I) Ca(2+)/calmodulin-dependent PDEs; (II) cGMP- 
stimulated PDEs; (EI) cGMP-inhibited PDEs; (IV) cAMP-specific PDEs; (V) cGMP-specific 
PDEs; (VI) photoreceptor PDEs; and (VTJ) high-affinity, cAMP-specific PDEs. Because of 
their role in signal transduction, cyclic nucleotide phosphodiesterases have been pursued as 
therapeutic or pharmacologic targets in the modulation of a variety of distinct physiological 
processes. 

cAMP phosphodiesterase inhibitors hold great promise as therapeutic agents for use in 
the treatment of inflammation. Specifically, data indicates that these types of inhibitors are as 
effective, or even more effective, than adrenal steroids in suppressing most functions of 
inflammat ory cells, including: migration, adhesion and secretion of cytokines. Specific cAMP 
phosphodiesterase inhibitors that have been studied include: rolipram, theophylline, and the 
like. In addition, research is ongoing to identify new cAMP phosphodiesterase inhibitors. 
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Despite their promise as anti-inflammatory therapeutic agents, cAMP- 
phosphodiesterase inhibitors identified to date have demonstrated significant toxic side effects 
that have limited to their generalized use in the treatment of inflammation. 

As such, there is continued interest in the identification of new, more selective cAMP 
5 phosphodiesterase inhibitors for potential use as anti-inflammatory therapeutic agents. These 
efforts have employed recombinant phosphodiesterases for automated screening of candidate 
agents. Use of recombinant phosphodiesterases in screening applications has, however, been 
problematic as such recombinant enzymes have altered conformation as compared to their 
naturally occ urring counterparts, which affects the interaction with potential inhibitors and 
1 0 thereby confounds the results that are obtained. As such, the screening results obtained by 
using such recombinant proteins are problematic. 

Therefore, there is much interest in the further elucidation of the conformation of 
phosphodiesterases and other factors that may modulate the interaction of these enzymes with 
inhibitors. 
15 Relevant Literati™ 

The role of cAMP phosphodiesterases in inflammatory processes is reviewed in 
Torphy, Am. J. Respir. Crit. Care Med. (1998) 157:35 1-370. See also Houslay et ai, Adv. 
Pharmacol (1998) 44: 225-342 and Spina et al., Adv. Pharmacol (1998) 44: 33-89, as well as 
U.S. Patent No. 5,798,373, the disclosure of which is herein incorporated by reference. 

20 

SUMMARY OF THF. TNVFTvrnQN 
Nucleic acid compositions encoding phosphodiesterase interacting proteins, e.g. 
myomegalin, as well as the polypeptide compositions encoded thereby, are provided. Also 
provided are complexes of the subject phosphodiesterase interacting protein with a 
25 phosphodiesterase enzyme. The subject polypeptide and nucleic acid compositions, as well as 
complexes thereof, find use in a variety of applications, including research, diagnostic, and 
therapeutic agent identification and screening applications, as well as in therapeutic 
applications. 

30 BRIEF DESCRIPTION OF THE FTfflmffS 

Figure 1 provides the amino acid sequence of rat myomegalin. 
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Figure 2 provides the cDNA sequence of a clone having an open reading frame 
encoding the myomegalin protein having the amino acid sequence of Figure 1. 

Figure 3 provides the nucleic acid sequence from the first met to the first stop codon 
in the sequence of Figure 2. 

Figure 4 provides the nucleic acid sequence of human myomegalin. 

Figure 5 provides the amino acid sequence of human myomegalin. 

Figure 6 provides the amino acid sequence of rat M14 protein. 

DETAILED DESCRIPTION OF THE TNVF.NTTON 
Novel phosphodiesterase interacting proteins, particularly myomegalin, as well as 
nucleic acid compositions encoding the same, are provided. Also provided are complexes of 
the subject proteins and phosphodiesterases. The subject polypeptide and nucleic acid 
compositions find use in a variety of applications, including research, diagnostic, and 
therapeutic agent identification and screening applications, as well as in therapeutic 
applications. 

Before the subject invention is described further, it is to be understood that the 
invention is not limited to the particular embodiments of the invention described below, as 
variations of the particular embodiments may be made and still fall within the scope of the 
appended claims. It is also to be understood that the terminology employed is for the purpose 
of describing particular embodiments, and is not intended to be limiting. Instead, the scope of 
the present invention will be established by the appended claims. 

In this specification and the appended claims, the singular forms "a," "an," and "the" 
include plural reference unless the context clearly dictates otherwise. Unless defined 
otherwise, all technical and scientific terms used herein have the same meaning as commonly 
understood to one of ordinary skill in the art to which this invention belongs. 

Nucleic acid Compositions 

Nucleic acid compositions encoding phosphodiesterase (PDE) interacting proteins, as 
well as fragments thereof are provided. The subject nucleic acid compositions encode 
proteins that interact with a phoshodiesterase enzyme, modulate its conformation and direct 
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its location in a cell. In other words, the proteins encoded by the subject nucleic acid 
compositions are those that target a (PDE) to a particular subcellular compartment and alter 
the function and/or properties of the PDE. Of particular interest are nucleic acid compositions 
which encode proteins that bind to a PDE IV isoenzyme, including PDE4A, PDE4B, PDE4C 
PDE4D, and the like. 

By nucleic acid composition is meant a composition comprising a sequence of DNA 
having an open reading frame that encodes a PDE interacting polypeptide, i.e. a gene 
encoding a polypeptide that interacts with a PDE (e.g. binds to and targets a PDE), and is 
capable, under appropriate conditions, of being expressed as a PDE interacting polypeptide. 
Also encompassed in this term are nucleic acids that are homologous, substantially similar or 
identical to the nucleic acids encoding PDE interacting polypeptides or proteins. Thus, the 
subject invention provides genes encoding mammalian PDE interacting proteins, such as 
genes encoding human PDE interacting polypeptides and homologs thereof, as well as non- 
human mammalian PDE interacting polypeptides and homologs thereof, e.g. rat and mouse 
proteins. 

Of particular interest is a nucleic acid composition encoding a myomegalin protein, 
particularly a mammalian myomegalin protein, described in greater detail infra, or a fragment 
or homolog thereof. Specific nucleic acid compositions of interest include: polynucleotides 
encoding a rat myomegalin protein, such as polynucleotides having a nucleotide sequence 
found in SEQ ED NOs: 1 or 3, including polynucleotides in which the entire sequence is the 
same as the sequence of SEQ ID NOs. 1 or 3; and polynucleotides encoding human 
myomegalin protein, such as polynucleotides having a nucleotide sequence found in SEQ ID 
NO:04, including polynucleotides in which the entire sequence is the same as the sequence of 
SEQ ID NOs. 04, as well as those in which the entire sequence is the same as the sequence of 
an ORF found in SEQ ID NO:04. 

Also of interest are nucleic acid compositions encoding an Ml 4 polypeptide, 
described in greater detail infra, or a fragment or homolog thereof. Specific nucleic acid 
compositions of interest include polynucleotides encoding a rat M14 polypeptide, such as 
polynucleotides encoding an M14 polypeptide having the amino acid sequence set forth in 
SEQ ID NO:08. Polynucleotides encoding M14 homologs, and polynucleotides encoding 
PDE-interacting fragments of an M14 polypeptide, are also of interest. 
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Also of interest are nucleic acid compositions encoding a huntingtin-interacting 
protein, e.g., HTP1. Specific nucleic acid compositions of interest include a polynucleotide 
encoding a human HIP1 polypeptide, including, for example, a polynucleotide as disclosed in 
GenBank Accession No. U79734. 

The source of homologous genes to those specifically listed above may be any 
mamma l i an species, e.g., primate species, particularly human; rodents, such as guinea pigs 
and mice, canines, felines, bovines, ovines, equines, yeast, nematodes, etc. Between 
mammal i an species, e.g., human and mouse, homologs have substantial sequence similarity, 
e.g. at least 75% sequence identity, usually at least 90%, more usually at least 95% between 
nucleotide sequences. Sequence similarity is calculated based on a reference sequence, which 
may be a subset of a larger sequence, such as a conserved motif, coding region, flanking 
region, etc. A reference sequence will usually be at least about 18 nt long, more usually at 
least about 30 nt long, and may extend to the complete sequence that is being compared. 
Algorithms for sequence analysis are known in the art, such as BLAST, described in Altschul 
etal (1990), J. Mol Biol 215:403-10. Unless stated otherwise herein, all sequence identity 
figures provided in this application are determined using the BLAST program at default 
settings (e.g. w=4; T=\l). The sequences provided herein are essential for recognizing genes 
encoding PDE interacting protein-related and homologous polynucleotides in database 
searches. 

Nucleic acids encoding the subject PDE interacting proteins and polypeptides of the 
subject invention may be cDNAs or genomic DNAs, as well as fragments thereof. Also 
provided are genes comprising the subject nucleic acid compositions, where the term "gene" 
shall be intended to mean the open reading frame encoding specific PDE interacting proteins 
and polypeptides, and introns, as well as adjacent 5 ' and 3 ' non-coding nucleotide sequences 
involved in the regulation of expression, up to about 20 kb beyond the coding region, but 
possibly further in either direction. The gene may be introduced into an appropriate vector for 
extrachromosomal maintenance or for integration into a host genome. 

The term "cDNA" as used herein is intended to include all nucleic acids that share the 
arrangement of sequence elements found in native mature mRNA species, where sequence 
elements are exons and 3' and 5' non-coding regions. Normally mRNA species have 
contiguous exons, with the intervening introns, when present, being removed by nuclear RNA 
splicing, to create a continuous open reading frame encoding an PDE interacting protein. 
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A genomic sequence of interest comprises the nucleic acid present between the 
initiation codon and the stop codon, as defined in the listed sequences, including all of the 
introns that are normally present in a native chromosome. It may further include the 3 ' and 5' 
untranslated regions found in the mature mRNA. It may further include specific 
5 transcriptional and translation^ regulatory sequences, such as promoters, enhancers, etc., 
including about 1 kb, but possibly more, of flanking genomic DNA at either the 5 ' or 3 ' end 
of the transcribed region. The genomic DNA may be isolated as a fragment of 100 kbp or 
smaller; and substantially free of flanking chromosomal sequence. The genomic DNA flanking 
the coding region, either 3' or 5', or internal regulatory sentences as sometimes found in 
10 introns, contains sequences required for proper tissue and stage specific expression. 

The nucleic acid compositions of the subject invention may encode all or a part of the 
subject PDE interacting proteins and polypeptides, described in greater detail infra. Double 
or single stranded fragments may be obtained from the DNA sequence by chemically ' 
synthesizing oligonucleotides in accordance with conventional methods, by restriction enzyme 
1 5 digestion, by PCR amplification, etc. For the most part, DNA fragments will be of at least 
15 nt, usually at least 18 nt or 25 nt, and may be at least about 50 nt. 

The genes of the subject invention are isolated and obtained in substantial purity, 
generally as other than an intact chromosome. Usually, the DNA will be obtained 
substantially free of other nucleic acid sequences that do not include a sequence encoding a 
20 PDE interacting protein or fragment thereof, generally being at least about 50%, usually at 
least about 90% pure and are typically "recombinant," i.e. flanked by one or more nucleotides 
with which it is not normally associated on a naturally occurring chromosome. 

In addition to the plurality of uses described in greater detail in following sections, the 
subject nucleic acid compositions find use in the preparation of all or a portion of the PDE 
25 interacting polypeptides, as described below. 

Polypeptide Compositions 

Also provided by the subject invention are PDE interacting proteins and polypeptides, i.e. 
proteins and polypeptides that are capable of binding to and modulating PDEs, specifically 
30 cAMP-PDEs, and more particularly cAMP-PDE4 isoforms, such as PDE4A, PDE4B, PDE4C, 
PDE4D, and the like. 
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The term polyeptide composition as used herein refers to both the full length proteins 
as well as portions or fragments thereof. Also included in this term are variations of the 
naturally occurring proteins, where such variations are homologous or substantially similar to 
the naturally occurring protein, as described in greater detail below, be the naturally occurring 
protein the human protein, rat protein, or protein from some other species which naturally 
expresses an PDE interacting protein, usually a mammalian species. In the following 
description of the subject invention, the term PDE interacting protein is used to refer not only 
to the human form of such proteins, but also to homologs thereof expressed in non-human 
species, e.g. murine, rat and other mammali an species. 

The subject PDE proteins are, in their natural environment, capable of modulating the 
form/function of PDEs, as well as targeting PDEs to specific subcellular compartments within 
a cell. In many embodiments, the subject PDE interacting proteins serve as PDE anchoring 
proteins. 

In many embodiments, the subject proteins are characterized by the presence of one or 
more coiled domains and leucine zippers. Furthermore, in certain embodiments, e.g. certain 
rat myomegalin proteins, the subject proteins have a region of high homology with 
Drosophila centrosomin, whereby high homology is meant at least about 30, usually at least 
about 40 % sequence identity. 

In many embodiments, the proteins range in length from about 1500 to 3000, usually 
from about 1600 to 2800 and more usually from about 1650 to 2600 amino acid residues, and 
the projected molecular weight of the subject proteins based solely on the number of amino 
acid residues in the protein ranges from about 150 to 320, usually from about 160 to 300 
kDa, where the actual molecular weight may vary depending on the amount of glycolsylation, 
if any, of the protein and the apparent molecular weight may be considerably less (40 to 50 
kDa) due to SDS binding on gels. On other embodiments, the length of the proteins may be 
much smaller, e.g. as in the case of splice variants or post translated products, where the 
length in these proteins may be as short as 40%, usually no shorter than about 50% of the 
above lengths. 

Of particular interest in many embodiments are proteins that are non-naturally 
glycosylated. By non-naturally glycosylated is meant that the protein has a glycosylation 
pattern, if present, which is not the same as the glycosylation pattern found in the 
corresponding naturally occurring protein. For example, a human phosphodiesterase binding 
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protein of the subject invention and of this particular embodiment is characterized by having a 
glycosylation pattern, if it is glycosylated at all, that differs from that of naturally occurring 
human PDE binding protein. Thus, the non-naturally glycosylated PDE interacting or binding 
proteins of this embodiment include non-glycosylated PDE interacting proteins, i.e. proteins 
having no covalently bound glycosyl groups. 

A PDE interacting protein of the subject invention of particular interest is 
myomegalin, particularly mammalian myomegalin and more particuariy, rat or human 
myomegalin. In many embodiments, mammalian myomegalin ranges in length from about 
2000 to 3000, usually from about 2700 to 2800 and more usually from about 2300 to 2600 aa 
residues. The projected molecular weight of these myomegalin proteins based solely on the 
number of amino acid residues in the protein ranges from about 220 to 320, usually from 
about 220 to 300 and more usually from about 240 to 300 kDa, where the actual molecular 
weight may vary depending on the amount of glycolsyiation, if any, of the protein and the 
apparent molecular weight may be considerably less (40 to 50 kDa) due to SDS binding on 
1 5 gels. Also of interest are mammalian myomegalin proteins that are shorter than those 
described above, where these shorter proteins could be splice variants or the products of 
post-translational activity, and the like. 

Of particular interest in certain embodiments is the rat myomegalin protein, where the 
rat myomegalin protein of the subject invention has an amino acid sequence that is 
20 substantially the same as or identical to the sequence appearing as SEQ ID NO:02 infra and 
appearing in Figure 1 By substantially the same as is meant a protein having a sequence that 
has at least about 80%, usually at least about 90% and more usually at least about 98% 
sequence identity with the sequence of SED ID NO:02. Also of particular interest is an 
approximately 65 kDa rat myomegalin protein expressed in rat testis. Yet another protein of 
25 particular interest is the human myomegalin protein of the subject invention which has an 

amino acid sequence that is substantially the same as or identical to the sequence appearing as 
SEQ ID NO: 05 infra and appearing in Figure 5. By substantially the same as is meant a 
protein having a sequence that has at least about 80%, usually atleast about 90% and more 
usually at least about 98% sequence identity with the sequence of SED ID NO: 05. 
30 Another PDE interacting protein of the subject invention of particular interest is M 1 4, 

particularly m a mm a li an M 1 4, and more particularly, rat or human M 1 4. In many 
embodiments, m a mm alian M14 ranges in length from about 1500 to about 2000, usually from 
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about 1600 to about 1800, usually from about 1650 to about 1700, and more usually from 
about 1670 to about 1690 amino acid residues. The projected molecular weight of these M14 
polypeptides, based solely on the number of amino acid residues in the protein, ranges from 
about 150 to about 200 kDa, usually from about 160 to about 180 kDa, usually from about 
165 to about 170 kDa. Rat Ml 4 protein has a mobility on SDS-PAGE of about 185 kDa. 
The actual molecular weight may vary depending on the amount of glycosylation or other 
post-translational modifications, if any, of the protein, and the apparent molecular weight may 
be considerably less (e.g. 40-50 kDa) due to SDS binding on gels. Also of interest are PDE- 
interacting fragments of the above-described Ml 4 proteins. 

Of particular interest in certain embodiments is a rat M14 protein, where the rat M14 
protein of the subject invention has an amino acid sequence that is substantially the same or 
identical to the sequence set forth in SEQ ID NO: 08 and appearing in Figure 6. By 
substantially the same as is meant a protein having a sequence that has at least about 80%, 
usually at least about 90% and more usually at least about 98% sequence identity with the 
sequence ofSED ID NO:08. Proteins homologous to rat M14 are also of interest, including, 
e.g., anEse2L protein as described in Sengar et al (\999) EMBO J. 18:1159-1171. 

Also of interest are huntingtin interacting proteins, and PDE-interacting fragments, 
variants and homologs thereof. In some embodiments, huntingtin interacting protein (HIP) is 
a hum a n HIP1 protein having an amino acid sequence as disclosed in GenBank Accession No. 
U79734, The human HIP 1 protein is described in Kalcfaman et al. (1997) Nature Genetics 
16:44-53. 

In addition to the specific PDE interacting proteins described above, homologs or 
proteins (or fragments thereof) from other species, i.e. other animal or plant species, are also 
provided, where such homologs or proteins may be from a variety of different types of 
species, usually m a mm a l s, e.g. rodents, such as mice, rats; domestic animals, e.g. horse, cow, 
dog, cat; and humans. By homolog is meant a protein having at least about 35 %, usually at 
least about 40% and more usually at least about 60 % amino acid sequence identity with a 
specific PDE interacting protein as identified in: (a) SEQ ID NO: 02 and appearing in Figure 
1; or (b) SEQ ID NO:05 and appearing in Figure 5; or (c) SEQ ID NO:08 and appearing in 
Figure 6. 

The PDE interacting proteins of the subject invention (e.g. human myomegalin, rat 
myomegalin or homologs thereof) are present in a non-naturally occurring environment, e.g. 
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are separated from their naturally occurring environment. In certain embodiments, the subject 
protein is present in a composition that is enriched for the subject protein as compared to the 
protein in its naturally occurring environment. As such, purified PDE interacting protein is 
provided, where by purified is meant that PDE interacting protein is present in a composition 
5 that is substantially free of non PDE interacting proteins, where by substantially free is meant 
that less than 90 %, usually less than 60 % and more usually less than 50 % of the 
composition is made up of non-PDE interacting proteins. 

In certain embodiments of interest, the PDE interacting protein is present in a 
composition that is substantially free of the constituents that are present in its naturally 
1 0 occurring environment. For example, a human PDE interacting protein comprising 

composition according to the subject invention in this embodiment will be substantially, if not 
completely, free of those other biological constituents, such as proteins, carbohydrates, lipids, 
etc., with which it is present in its natural environment. As such, protein compositions of 
these embodiments will necessarily differ from those that are prepared by purifying the protein 
from a naturally occurring source, where at least trace amounts of the protein's constituents 
will still be present in the composition prepared from the naturally occurring source. 

The PDE interacting protein of the subject invention may also be present as an isolate, 
by which is meant that the PDE interacting protein is substantially free of both non-PDE 
interacting proteins and other naturally occurring biologic molecules, such as 
oligosaccharides, polynucleotides and fragments thereof, and the like, where substantially free 
in this instance means that less than 70 %, usually less than 60% and more usually less than 
50 % of the composition containing the isolated PDE interacting protein is a non-PDE 
interacting protein naturally occurring biological molecule. In certain embodiments, the 
subject protein is present in substantially pure form, where by substantially pure form is meant 
at least 95%, usually at least 97% and more usually at least 99% pure. 

In addition to the naturally occurring proteins, polypeptides which vary from the 
naturally occurring proteins are also provided. By polypeptides is meant proteins having an 
amino acid sequence encoded by an open reading frame (ORF) of an gene according to the 
subject invention, described supra, including the full length protein and fragments thereof, 
particularly biologically active fragments and/or fragments corresponding to functional 
domains; and including fusions of the subject polypeptides to other proteins or parts thereof. 
Fragments of interest will typically be at least about 10 aa in length, usually at least about 50 
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aa in length, and may be as long as 300 aa in length or longer, but will usually not exceed 
about 1000 aa in length, where the fragment will have a stretch of amino acids that is identical 
to the protein of SEQ ID NO:02, SEQ ID NO:05, or SEQ ID NO:08, or a homolog thereof; 
of at least about 10 aa, and usually at least about 15 aa, and in many embodiments at least 
about 50 aa in length. 

Preparation of PDE Interacting Polypeptides 

The subject PDE interacting proteins and polypeptides may be obtained from naturally 
occurring sources or synthetically produced. Wnere obtained from naturally occurring 
sources, the source chosen will generally depend on the species from which the PDE 
interacting protein is to be derived, e.g. muscle tissue, heart tissue, brain tissue, testis tissue, 
and the like. 

The subject PDE interacting polypeptide compositions may be synthetically derived by 
expressing a recombinant gene encoding the PDE interacting protein, such as the 
polynucleotide compositions described above, in a suitable host. For expression, an 
expression cassette may be employed. The expression vector will provide a transcriptional 
and translationai initiation region, which may be inducible or constitutive, where the coding 
region is operably linked under the transcriptional control of the transcriptional initiation 
region, and a transcriptional and translationai termination region. These control regions may 
be native to the gene encoding the particular PDE interacting protein, or may be derived from 
exogenous sources. 

Expression vectors generally have convenient restriction sites located near the 
promoter sequence to provide for the insertion of nucleic acid sequences encoding 
heterologous proteins. A selectable marker operative in the expression host may be present. 
Expression vectors may be used for the production of fusion proteins, where the exogenous 
fusion peptide provides additional functionality, i.e. increased protein synthesis, stability, 
reactivity with defined antisera, an enzyme marker, e.g. (J-galactosidase, etc. 

Expression cassettes may be prepared comprising a transcription initiation region, the 
gene or fragment thereof, and a transcriptional termination region. Of particular interest is the 
use of sequences that allow for the expression of functional epitopes or domains, usually at 
least about 8 amino acids in length, more usually at least about 1 5 amino acids in length, to 
about 25 amino acids, and up to the complete open reading frame of the gene. After 
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introduction of the DNA, the cells containing the construct may be selected by means of a 
selectable marker, the cells expanded and then used for expression. 

The subject proteins and polypeptides may be expressed in prokaryotes or eukaryotes 
in accordance with conventional ways, depending upon the purpose for expression. For large 
scale production of the protein, a unicellular organism, such as E. coli, B. subtilis, 
S. cerevisiae, insect cells in combination with baculovirus vectors, or cells of a higher 
organism such as vertebrates, particularly mammals, e.g. COS 7 cells, may be used as the 
expression host cells. In some situations, it is desirable to express the subject proteins in 
eukaryotic cells, where the protein wiil benefit from native folding and post-transktional 
modifications. Small peptides can also be synthesized in the laboratory. Polypeptides that are 
subsets of the complete protein sequence may be used to identify and investigate parts of the 
protein important for function. 

Once the source of the protein is identified and/or prepared, e.g. a transfected host 
expressing the protein is prepared, the protein is then purified to produce the desired PDE 
interacting protein comprising composition. Any convenient protein purification procedures 
may be employed, where suitable protein purification methodologies are described in Guide to 
Protein Purification, (Deuthser ed.) (Academic Press, 1990). For example, a lysate may be 
prepared from the original source, e.g. naturally occurring cells or tissues that express a PDE 
interacting protein or the expression host expressing the PDE interacting protein, and purified 
using HPLC, exclusion chromatography, gel electrophoresis, affinity chromatography, and the 
like. 

Uses of the Subject Polypeptide and Nucleic Acid Compositions 

The subject polypeptide and nucleic acid compositions find use in a variety of different 
applications, including diagnostic, and therapeutic agent screening/discovery/preparation 
applications, as well as the treatment of disease conditions associated with PDE interacting 
protein activity. 

General Applications 

The subject nucleic acid compositions find use in a variety of applications, including: 

(a) the identification of PDE interacting protein gene homologs, e.g. myomegalin homologs; 

(b) as a source of novel promoter elements; (c) the identification of PDE interacting protein 
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expression regulatory factors; (d) as probes and primers in hybridization applications, e.g. 
PCR; (e) the identification of expression patterns in biological specimens; (f) the preparation 
of cell or animal models for PDE interacting protein function; (g) the preparation of in vitro 
models for PDE interacting protein function; etc. 

Identification of homology 

Homologs of the PDE interacting protein gene, e.g. the myomegalin gene, or the 
M14 gene, are identified by any of a number of methods. A fragment of the provided cDNA 
may be used as a hybridization probe against a cDNA library from the target organism of 
interest, where low stringency conditions are used. The probe may be a large fragment, or one 
or more short degenerate primers. Nucleic acids having sequence similarity are detected by 
hybridization under low stringency conditions, for example, at 50°C and 6*SSC (0.9 M 
sodium chloride/0.09 M sodium citrate) and remain bound when subjected to washing at 
55°C in 1 *SSC (0. 15 M sodium chloride/0.015 M sodium citrate). Sequence identity may be 
determined by hybridization under stringent conditions, for example, at 50°C or higher and 
0. 1 *SSC (15 mM sodium chloride/01.5 mM sodium citrate). Nucleic acids having a region 
of substantial identity to the provided sequences, e.g. allelic variants, genetically altered 
versions of the gene, etc., bind to the provided sequences under stringent hybridization - 
conditions. By using probes, particularly labeled probes of DNA sequences, one can isolate 
homologous or related genes. 

Identification of Novel Pro moter Elements 

The sequence of the 5' flanking region may be utilized for promoter elements, 
including enhancer binding sites, that provide for regulation in tissues where the subject gene 
is expressed. The tissue specific expression is useful for deterrnining the pattern of expression, 
and for providing promoters that mimic the native pattern of expression. Naturally occurring 
polymorphisms in the promoter region are useful for deterrnining natural variations in 
expression, particularly those that may be associated with disease. 

Identification of Expression Repiilatory Fflfiforfi 

Alternatively, mutations may be introduced into the promoter region to determine the 
effect of altering expression in experimentally defined systems. Methods for the identification 
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of specific DNA motifs involved in the binding of transcriptional factors are known in the art, 
e.g. sequence similarity to known binding motifs, gel retardation studies, etc. For examples, 
see Blackwell et al (1995), Mol Med. 1: 194-205; Mortlock et al. (1996), Genome Res. 
6:327-33; and Joulin and Richard-Foy (1995), Eur. J. Biochem. 232:620-626. 

The regulatory sequences may be used to identify cis acting sequences required for 
transcriptional or translational regulation of expression of the subject gene, e.g. the 
myomegalm gene, especially in different tissues or stages of development, and to identify cis 
acting sequences and /reacting factors that regulate or mediate expression of the subject 
gene. Such transcription or translational control regions may be operabiy linked to a gene of 
the subject invention in order to promote expression of wild type or altered PDE interacting 
protein, e.g. myomegalin, or other proteins of interest in cultured cells, or in embryonic, fetal 
or adult tissues, and for gene therapy. 

Probes and Primes. 

Small DNA fragments are useful as primers for PCR, hybridization screening probes, 
etc. Larger DNA fragments, i.e. greater than 100 nt are useful for production of the encoded 
polypeptide, as described in the previous section. For use in amplification reactions, such as 
PCR, a pair of primers will be used. The exact composition of the primer sequences is not 
critical to the invention, but for most applications the primers will hybridize to the subject 
sequence under stringent conditions, as known in the art. It is preferable to choose a pair of 
primers that will generate an amplification product of at least about 50 nt, preferably at least 
about 100 nt. Algorithms for the selection of primer sequences are generally known and are 
available in commercial software packages. Amplification primers hybridize to 
complementary strands of DNA, and will prime towards each other. 

Identification of Expression Patterns in Biological Spe cifflfinft 

The DNA may also be used to identify expression of the gene in a biological 
specimen. The manner in which one probes cells for the presence of particular nucleotide 
sequences, as genomic DNA or RNA, is well established in the literature. Briefly, DNA or 
mRNA is isolated from a cell sample. The mRNA may be amplified by RT-PCR, using reverse 
transcriptase to form a complementary DNA strand, followed by polymerase chain reaction 
amplification using primers specific for the subject DNA sequences. Alternatively, the mRNA 
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sample is separated by gel electrophoresis, transferred to a suitable support, e.g. 
nitrocellulose, nylon, etc., and then probed with a fragment of the subject DNA as a probe. 
Other techniques, such as oligonucleotide ligation assays, in situ hybridizations, and 
hybridization to DNA probes arrayed on a solid chip may also find use. Detection of mRNA 
hybridizing to the subject sequence is indicative of gene expression in the sample. 

The Preparation of PDF Tnteractinp Pmt^ n Mi i t a nt f i 

The sequence of a gene according to the subject invention, including flanking 
promoter regions and coding regions, may be mutated in various ways known in the art tc 
generate targeted changes in promoter strength, sequence of the encoded protein, etc. The 
DNA sequence or protein product of such a mutation will usually be substantially similar to 
the sequences provided herein, i.e. will differ by at least one nucleotide or amino acid, 
respectively, and may differ by at least two but not more than about ten nucleotides or amino 
acids. The sequence changes may be substitutions, insertions, deletions, or a combination 
thereof. Deletions may further include larger changes, such as deletions of a domain or exon. 
Other modifications of interest include epitope tagging, e.g. with the FLAG system, HA, etc. 
For studies of subcellular localization, fusion proteins with green fluorescent proteins (GFP) 
may be used. 

Techniques for in vitro mutagenesis of cloned genes are known. Examples of 
protocols for site specific mutagenesis may be found in Gustin ei al (1993), Biotechniques 
14:22; Barany (1985), Gene 37:111-23; Coliceili et al. (1985)MoL Gen. Genet 199:537-9; 
and Prentki et al. (1984), Gene 29:303-13. Methods for site specific mutagenesis can be 
found in Sambrook et al, Molecular Cloning: A laboratory Manual, CSH Press 1989, pp. 
15.3-15.108; Weiner etaL (1993), Gene 126:35-41; Sayers etal. (1992), Biotechniques 
13:592-6; Jones and Winistorfer (1992), Biotechniques 12:528-30; Barton etal. (1990), 
Nucleic Acids Res 18:7349-55; Marotti and Tomich (1989), Gene Anal. Tech. 6:67-70; and 
Zhu (1989), AnalBiochem 177:120-4. Such mutated genes may be used to study structure- 
function relationships ofPDE interacting proteins, or to alter properties of the protein that 
affect its function or regulatioa 
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Production Of In Vivo Models of PDF TntP r acting Prnt*i n y ur] ^\ m 

The subject nucleic acids can be used to generate transgenic, non-human animals or 
site specific gene modifications in cell lines. Transgenic animals may be made through 
homologous recombination, where the normal PDE interacting protein gene locus is altered. 
Alternatively, a nucleic acid construct is randomly integrated into the genome. Vectors for 
stable integration include plasmids, retroviruses and other animal viruses, YACs, and the like. 

The modified cells or animals are useful in the study of PDE interacting protein 
function and regulation. For example, a series of small deletions and/or substitutions may be 
made in the host's native PDE interacting protein gene to determine the role of different exons 
in cholesterol metabolism, e.g. cholesterol ester synthesis, cholesterol absorption, etc. 
Specific constructs of interest include anti-sense constructs which will block PDE interacting 
protein expression, expression of dominant negative gene mutations, and over-expression of 
PDE interacting protein genes. Where a particular genetic sequence is introduced, the 
introduced sequence may be either a complete or partial sequence of an PDE interacting 
protein gene native to the host, or may be a complete or partial sequence that is exogenous to 
the host a nim al, e.g., a human sequence. A detectable marker, such as lac Z, may be 
introduced into the locus, where upregulation of gene expression will result in an easily 
detected change in phenotype. 

One may also provide for expression of the gene or variants thereof in cells or tissues 
where it is not normally expressed, at levels not normally present in such cells or tissues, or at 
abnormal times of development. 

DNA constructs for homologous recombination will comprise at least a portion of the 
gene native to the species of the host animal, wherein the gene has the desired genetic 
modification(s), and includes regions of homology to the target locus. DNA constructs for 
random integration need not include regions of homology to mediate recombination. 
Conveniently, markers for positive and negative selection are included. Methods for 
generating cells having targeted gene modifications through homologous recombination are 
known in the art. For various techniques for transfecting mammalian cells, see Keown et al 
(1990), Meth. Enzymol. 185:527-537. 

For embryonic stem (ES) cells, an ES cell line may be employed, or embryonic cells 
may be obtained freshly from a host, e.g. mouse, rat, guinea pig, etc. Such cells are grown on 
an appropriate fibroblast-feeder layer or grown in the presence of leukemia inhibiting factor 
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(LIF). When ES or embryonic ceils have been transformed, they may be used to produce 
transgenic animals. After transformation, the cells are plated onto a feeder layer in an 
appropriate medium. Cells containing the construct may be detected by employing a selective 
medium. After sufficient time for colonies to grow, they are picked and analyzed for the 
occurrence of homologous recombination or integration of the construct. Those colonies that 
are positive may then be used for embryo manipulation and blastocyst injection. Blastocysts 
are obtained from 4 to 6 week old superovulated females. The ES cells are trypsinized, and 
the modified cells are injected into the blastocoel of the blastocyst. After injection, the 
blastocysts are returned to each uterine horn of pseudopregnant females. Females are then 
allowed to go to term and the resulting offspring screened for the construct. By providing for 
a different phenotype of the blastocyst and the genetically modified cells, chimeric progeny 
can be readily detected. 

The chimeric animals are screened for the presence of the modified gene and males 
and females having the modification are mated to produce homozygous progeny. If the gene 
alterations cause lethality at some point in development, tissues or organs can be maintained 
as allogeneic or congenic grafts or transplants, or in in vitro culture. The transgenic animals 
may be any non-human mammal, such as laboratory animals, domestic animals, etc. The 
transgenic animals may be used in functional studies, drug screening, etc., e.g. to determine 
the effect of a candidate drug on PDE interacting binding protein activity and/or the 
enzymatic activity of the PDE/PDE interacting protein complex. 



Production Of//? Vitro Models of PDF In fracting Protein Fun^o n 

One can also use the polypeptide compositions of the subject invention to produce in 
vitro models of PDE interacting protein function. In addition to the subject PDE interacting 
protein, such models will generally include at least a PDE as well as a cyclic nucleotide, and a 
means to monitor the activity of the enzyme in the presence of the PDE interacting protein, 
e.g. a labeled isotope, etc. 

Diagnostic Applications 

Also provided are methods of diagnosing disease states associated with PDE 
interacting protein activity, e.g. based on observed levels of PDE interacting protein or the 
expression level of the gene in a biological sample of interest. Samples, as used herein, include 
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biological fluids such as semen, blood, cerebrospinal fluid, tears, saliva, lymph, dialysis fluid 
and the like; organ or tissue culture derived fluids; and fluids extracted from physiological 
tissues. Also included in the term are derivatives and fractions of such fluids. The cells may 
be dissociated, in the case of solid tissues, or tissue sections may be analyzed. Alternatively a 
lysate of the cells may be prepared. 

A number of methods are available for determining the expression level of a gene or 
protein in a particular sample. Diagnosis may be performed by a number of methods to 
determine the absence or presence or altered amounts of normal or abnormal PDE interacting 
protein in a patient sample. For example, detection may utilize staining of cells or histological 
sections with labeled antibodies, performed in accordance with conventional methods. Cells 
are permeabilized to stain cytoplasmic molecules. The antibodies of interest are added to the 
cell sample, and incubated for a period of time sufficient to allow binding to the epitope, 
usually at least about 10 minutes. The antibody may be labeled with radioisotopes, enzymes, 
fluoresces, cheinmiminescers, or other labels for direct detection. Alternatively, a second 
stage antibody or reagent is used to amplify the signal. Such reagents are well known in the 
art. For example, the primary antibody may be conjugated to biotin, with horseradish 
peroxidase-conjugated avidin added as a second stage reagent. Alternatively, the secondary 
antibody conjugated to a flourescent compound, e.g. fluorescein, rhodamine, Texas red, etc. 
Final detection uses a substrate that undergoes a color change in the presence of the 
peroxidase. The absence or presence of antibody binding may be determined by various 
methods, including flow cytometry of dissociated cells, microscopy, radiography, scintillation 
counting, etc. 

Alternatively, one may focus on the expression of the gene. Biochemical studies may 
be performed to determine whether a sequence polymorphism in an coding region or control 
regions is associated with disease. Disease associated polymorphisms may include deletion or 
truncation of the gene, mutations that alter expression level, that affect the activity of the 
protein, etc. 

Changes in the promoter or enhancer sequence that may affect expression levels of 
the gene can be compared to expression levels of the normal allele by various methods known 
m the art. Methods for determining promoter or enhancer strength include quantitation of the 
expressed natural protein; insertion of the variant control element into a vector with a 
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reporter gene such as p-galactosidase, luciferase, chloramphenicol acetyltransferase, etc. that 
provides for convenient quantitation; and the like. 

A number of methods are available for analyzing nucleic acids for the presence of a 
specific sequence, e.g. a disease associated polymorphism. Where large amounts of DNA are 
available, genomic DNA is used directly. Alternatively, the region of interest is cloned into a 
suitable vector and grown in sufficient quantity for analysis. Cells that express the subject 
gene may be used as a source of mRNA, which may be assayed directly or reverse transcribed 
into cDNA for analysis. The nucleic acid may be amplified by conventional techniques, such 
as the polymerase chain reaction (PCR), to provide sufficient amounts for analysis. The use 
of the polymerase chain reaction is described in Saiki, etal (1985), Science 239:487, and a 
review of techniques may be found in Sambrook, et al Molecular Cloning- A T-ahpn umy 
Manual CSH Press 1989, pp. 14.2B 14.33. Alternatively, various methods are known in the 
art that utilize oligonucleotide ligation as a means of detecting polymorphisms, for examples 
see Riley et al (1990), Nucl Acids Res. 18:2887-2890; and Delahunty et al (1996), Am. J. 
Hum. Genet 58:1239-1246. 

A detectable label may be included in an amplification reaction. Suitable labels include 
fluorochromes, e.g. fluorescein isothiocyanate (FITC), rhodamine, Texas Red, phycoerythrin, 
aUophycocyanin, 6-carboxyfluorescein (6-FAM), 2 , ,7'-dimethoxy-4 , ,5 , -dichloro-6- 
carboxyfluorescein(JOE), 6-carboxy-X-rhodamine (ROX), 6-carboxy-2 , ,4 , ,7 , ,4,7- 
hexachlorofluorescein (HEX), 5-carboxyfluorescein (5-FAM) or N,N,N\N^tetramethyi-6- 
carboxyrhodamine (TAMRA), radioactive labels, e.g. 32 P, 3S S, 3 H; etc. The label may be a 
two stage system, where the amplified DNA is conjugated to biotin, haptens, etc. having a 
high affinity binding partner, e.g. avidin, specific antibodies, etc., where the-binding partner is 
conjugated to a detectable label. The label may be conjugated to one or both of the primers. 
Alternatively, the pool of nucleotides used in the amplification is labeled, so as to incorporate 
the label into the amplification product. 

The sample nucleic acid, e.g. amplified or cloned fragment, is analyzed by one of a 
number of methods known in the art. The nucleic acid may be sequenced by dideoxy or other 
methods, and the sequence of bases compared to a wild-type sequence. Hybridization with 
the variant sequence may also be used to determine its presence, by Southern blots, dot blots, 
etc. The hybridization pattern of a control and variant sequence to an array of 
oligonucleotide probes immobilized on a solid support, as described in US 5,445,934, or in 
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WO 95/35505, may also be used as a means of detecting the presence of variant sequences. 
Single strand conformational polymorphism (SSCP) analysis, denaturing gradient gel 
electrophoresis (DGGE), and heteroduplex analysis in gel matrices are used to detect 
conformational changes created by DNA sequence variation as alterations in electrophoretic 
mobility. Alternatively, where a polymorphism creates or destroys a recognition site for a 
restriction endonudease, the sample is digested with that endonuclease, and the products size 
fractionated to determine whether the fragment was digested. Fractionation is performed by 
gel or capillary electrophoresis, particularly acrylamide or agarose gels. 

Screening for mutations may be based on the functional or antigenic characteristics of 
the protein. Protein truncation assays are useful in detecting deletions that may affect the 
biological activity of the protein. Various immunoassays designed to detect polymorphisms in 
the subject PDE interacting proteins may be used in screening. Where many diverse genetic 
mutations lead to a particular disease phenotype, functional protein assays have proven to be 
effective screening tools. The activity of the encoded protein may be determined by 
comparison with the wild-type protein- 
Diagnostic methods of the subject invention in which the level of expression is of 
interest will typically involve comparison of the PDE interacting protein nucleic acid 
abundance of a sample of interest with that of a control value to determine any relative 
differences, where the difference may be measured qualitatively and/or quantitatively, which 
differences are then related to the presence or absence of an abnormal gene expression 
pattern. A variety of different methods for deterrnining the nucleic acid abundance in a sample 
are known to those of skill in the art, where particular methods of interest include those 
described in: Pietu et al., Genome Res. (June 1996) 6: 492-503; Zhao et al., Gene (April 24, 
1995) 156: 207-213; Soares , Curr. Opin. Biotechnol. (October 1997) 8: 542-546; Ravai, J. 
Pharmacol Toxicol Methods (November 1994) 32: 125-127; Chalifour et al., Anal. Biochem 
(February 1, 1994) 216: 299-304; Stolz & Tuan, Mol. Biotechnol. (December 19960 6: 225- 
230; Hong et al., Bioscience Reports (1982) 2: 907; and McGraw, Anal. Biochem. Q984) 
143: 298. Also of interest are the methods disclosed in WO 97/27317, the disclosure of which 
is herein incorporated by reference. 
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Screening Assays 

The subject PDE interacting proteins and polypeptides find use in various screening 
assays designed to identify therapeutic agents. The screening assays may be designed to 
identify agents that modulate, e.g. inhibit or enhance, the activity of the PDE interacting 
5 protein directly and thereby modulate the activity of the particular PDE that depends on the 
presence of the PDE interacting protein for its function. Alternatively, the assay may be 
designed to identify those agents that modify, e.g. enhance or inhibit, the activity of the PDE 
when present as a complex with the PDE interacting protein. 

Of particular interest are screening methods that provide for qualitative/quantitative 
1 0 measurements of a PDE enzyme activity in the presence of a particular candidate therapeutic 
agent and its PDE interacting protein, as such screening methods are capable of identifying 
highly selective PDE modulatory, e.g. inhibitory, agents. For example, the assay could be an 
assay which measures the activity of a PDE interacting protein/enzyme complex in the 
presence and absence of a candidate inhibitor agent. In this preferred screening assay 
1 5 embodiment, the PDE interacting protein/PDE complex will generally be a naturally occurring 
complex, i.e. a complex between a cyclic nucleotide PDE and its naturally occurring PDE 
interacting protein partner. Of particular interest are complexes between a cAMP-PDEIV and 
a myoxnegalin protein. 

The screening method may be an in vitro or in vivo format, where both formats are 
20 readily developed by those of skill in the art. Depending on the particular method, one or 
more of, usually one o£ the components of the screening assay may be labeled, where by 
labeled is meant that the components comprise a detectable moiety, e.g. a fluorescent or 
radioactive tag, or a member of a signal producing system, e.g. biotin for binding to an 
enzyme-streptavidin conjugate in which the enzyme is capable of converting a substrate to a 
25 chromogenic product. 

A variety of other reagents may be included in the screening assay. These include 
reagents like salts, neutral proteins, e.g. albumin, detergents, etc. that are used to facilitate 
optimal protein-protein binding and/or reduce non-specific or background interactions. 
Reagents that improve the efficiency of the assay, such as protease inhibitors, nuclease 
30 inhibitors, anu-microbial agents, etc. may be used. Specific PDE activity assays of interest 
include those described in U.S. Patent Nos. 5,798,373 and 5,580,888, the disclosures of 
which are herein incorporated by reference. 
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A variety of different candidate agents may be screened by the above methods. 
Candidate agents encompass numerous chemical classes, though typically they are organic 
molecules, preferably small organic compounds having a molecular weight of more than 50 
and less than about 2,500 daltons. Candidate agents comprise functional groups necessary 
for structural interaction with proteins, particularly hydrogen bonding, and typically include at 
least an amine, carbonyl, hydroxyl or carboxyl group, preferably at least two of the functional 
chemical groups. The candidate agents often comprise cyclical carbon or heterocyclic 
structures and/or aromatic or polyaromatic structures substituted with one or more of the 
above functional groups. Candidate agents are also found among biomoiecuies including 
peptides, saccharides, fatty acids, steroids, purines, pyrimidines, derivatives, structural 
analogs or combinations thereof 

Candidate agents are obtained from a wide variety of sources including libraries of 
synthetic or natural compounds. For example, numerous means are available for random and 
directed synthesis of a wide variety of organic compounds and biomoiecuies, including 
expression of randomized oligonucleotides and oligopeptides. Alternatively, libraries of 
natural compounds in the form of bacterial, fungal, plant and animal extracts are available or 
readily produced. Additionally, natural or synthetically produced libraries and compounds are 
readily modified through conventional chemical, physical and biochemical means, and may be 
used to produce combinatorial libraries. Known pharmacological agents may be subjected to 
directed or random chemical modifications, such as acylation, alkyiation, esterification, 
amidification, etc. to produce structural analogs. 



PDE Interacting Protein Nucleic Acid and Polypeptide Therapeutic Compositions 

The nucleic acid compositions of the subject invention also find use as therapeutic 
agents in situations where one wishes to enhance the PDE interacting protein activity in a 
host, e.g. in a mamma l i a n host in which PDE interacting protein activity is sufficiently low 
such that a disease condition is present, etc. The PDE interacting protein genes, gene 
fragments, or the encoded proteins or protein fragments are useful in gene therapy to treat 
disorders associated with defects the PDE interacting protein gene expression. Expression 
vectors may be used to introduce the gene into a ceU. Such, vectors generally have convenient 
restriction sites located near the promoter sequence to provide for the insertion of nucleic 
acid sequences. Transcription cassettes may be prepared comprising a transcription initiation 
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region, the target gene or fragment thereof, and a transcriptional termination region. The 
transcription cassettes may be introduced into a variety of vectors, e.g. plasmid; retrovirus, 
e.g. ientivirus; adenovirus; and the like, where the vectors are able to transiently or stably be 
maintained in the cells, usually for a period of at least about one day, more usually for a 
5 period of at least about several days to several weeks. 

The gene or protein may be introduced into tissues or host cells by any number of 
routes, including viral infection, microinjection, or fusion of vesicles. Jet injection may also be 
used for intramuscular administration, as described by Furth et ai. (1992), Anal Biochem 
205:365-368. The DNAmay be coated onto gold microparticles, and delivered intradermal!} 
by a particle bombardment device, or "gene gun" as described in the literature (see, for 
example, Tang et ai (1992), Nature 356: 152-154), where gold microprojectiles are coated 
with the DNA, then bombarded into skin cells. 

Methods of Modulating PDE Interacting Protein Activity in a. Host 

Also provided are methods of regulating, including enhancing and inhibiting, PDE 
interacting protein activity in a host. Where the PDE interacting protein activity occurs in 
vivo in a host, an effective amount of active agent that modulates the activity, e.g. reduces the 
activity, of the PDE interacting protein in vivo (e.g. the activity of the naturally occurring 
PDE/interacting protein complex), is . administered to the host. The active agent may be a 
variety of different compounds, including a naturally occurring or synthetic small molecule 
compound, an antibody, fragment or derivative thereof; an antisense composition, and the 
like. 

Naturally occurring or synthetic small molecule compounds of interest include 
numerous chemical classes, though typically they are organic molecules, preferably small 
organic compounds having a molecular weight of more than 50 andless than about 2,500 
daltons. Candidate agents comprise functional groups necessary for structural interaction 
with proteins, particularly hydrogen bonding, and typically include at least an amine, carbonyl, 
hydroxy] or carboxyl group, preferably at least two of the functional chemical groups. The 
candidate agents often comprise cyclical carbon or heterocyclic structures and/or aromatic or 
polyaromatic structures substituted with one or more of the above functional groups. 
Candidate agents are also found among biomolecules including peptides, saccharides, fatty 
acids, steroids, purines, pyrimidines, derivatives, structural analogs or combinations thereof. 
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Also of interest as active agent are antibodies that modulate, e.g. reduce, if not inhibit, 
PDE interacting protein activity in the host. Suitable antibodies are obtained by immunizing a 
host animal with peptides comprising all or a portion of the subject proteins, such as found in 
the polypeptide compositions of the subject invention. Suitable host animals include mouse, 
rat sheep, goat, hamster, rabbit, etc. The origin of the protein immunogen may be mouse, 
human, rat, monkey etc. The host animal will generally be a different species than the 
immunogen, e.g. human protein used to immunize mice, etc. 

The immunogen may comprise the complete protein, or fragments and derivatives 
thereof. Preferred immunogens comprise ail or a part of the PDE interacting protein, where 
these residues contain the post-translation modifications, such as glycosylation, found on the 
native protein. Immunogens comprising the extracellular domain are produced in a variety of 
ways known in the art, e.g. expression of cloned genes using conventional recombinant 
methods, isolation from HEC, etc. 

For preparation of polyclonal antibodies, the first step is immunization of the host 
animal with the immunogen, where the immunogen will preferably be in substantially pure 
form, comprising less than about 1% contaminant. The immunogen may comprise complete 
PDE interacting protein, fragments or derivatives thereof. To increase the immune response 
of the host animal, the protein or peptide may be combined with an adjuvant, where suitable 
adjuvants include alum, dextran, sulfate, large polymeric anions, oil & water emulsions, e.g. 
Freund's adjuvant, Freund's complete adjuvant, and the like. The immunogen may also be 
conjugated to synthetic carrier proteins or synthetic antigens. A variety of hosts may be 
immunized to produce the polyclonal antibodies. Such hosts include rabbits, guinea pigs, 
rodents, e.g. mice, rats, sheep, goats, and the like. The immunogen is administered to the 
host, usually intradermal^, with an initial dosage followed by one or more, usually at least 
two, additional booster dosages. Following immunization, the blood from the host will be 
collected, followed by separation of the serum from the blood cells. The Ig present in the 
resultant antiserum may be further fractionated using known methods, such as ammonium salt 
fractionation, DEAE chromatography, and the like. 

Monoclonal antibodies are produced by conventional techniques. Generally, the 
spleen and/or lymph nodes of an immunized host animal provide a source of plasma cells. 
The plasma cells are immortalized by fusion with myeloma cells to produce hybridoma cells. 
Culture supernatant from individual hybridomas is screened using standard techniques to 
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identify those producing antibodies with the desired specificity. Suitable animals for 
production of monoclonal antibodies to the human protein include mouse, rat, hamster, etc. 
To raise antibodies against the mouse protein, the animal will generally be a hamster, guinea 
pig, rabbit, etc. The antibody may be purified from the hybridoma cell supernatants or ascites 
fluid by conventional techniques, e.g. affinity chromatography using PDE-interacting protein 
bound to an insoluble support, protein A sepharose, etc. 

The antibody may be produced as a single chain, instead of the normal multimeric 
structure. Single chain antibodies are described in Jost et al. (1994) ULSL 269:26267-73, 
and others. DNA sequences encoding the variable region of the heavy chain and the variable 
region of the light chain are ligated to a spacer encoding at least about 4 amino acids of small 
neutral amino acids, including glycine and/or serine. The protein encoded by this fusion 
allows assembly of a functional variable region that retains the specificity and affinity of the 
original antibody. 

For in vivo use, particularly for injection into humans, it is desirable to decrease the 
antigenicity of the antibody. An immune response of a recipient against the blocking agent 
will potentially decrease the period of time that the therapy is effective. Methods of 
humanizing antibodies are known in the art. The humanized antibody may be the product of 
an animal having transgenic human immunoglobulin constant region genes (see for example 
International Patent Applications WO 90/10077 and WO 90/04036). Alternatively, the 
antibody of interest may be engineered by recombinant DNA techniques to substitute the 
CHI, CH2, CH3, hinge domains, and/or the framework domain with the corresponding 
human sequence (see WO 92/02190). 

The use of Ig cDNA for construction of chimeric immunoglobulin genes is known in 
the art (Liu e/ a/. (1987) EfcLAJL 84:3439 and (1 987) IJmmanol 139:3521). mRNAis 
isolated from a hybridoma or other cell producing the antibody and used to produce cDNA. 
The cDNA of interest may be amplified by the polymerase chain reaction using specific 
primers (U.S. Patent nos. 4,683,195 and 4,683,202). Alternatively, a library is made and 
screened to isolate the sequence of interest. The DNA sequence encoding the variable region 
of the antibody is then fused to human constant region sequences. The sequences of human 
constant regions genes may be found in Kabat et al. (1991) Sequences of Proteins of 
Immunological Interest, N.I H. publication no. 91-3242. Human C region genes are readily 
available from known clones. The choice of isotype will be guided by the desired effector 
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functions, such as complement fixation, or activity in antibody-dependent cellular 
cytotoxicity. Preferred isotypes are IgGl, IgG3 and IgG4. Either of the human light chain 
constant regions, kappa or lambda, may be used. The chimeric, humanized antibody is then 
expressed by conventional methods. 

Antibody fragments, such as Fv, F(ab) 2 and Fab may be prepared by cleavage of the 
intact protein, e.g. by protease or chemical cleavage. Alternatively, a truncated gene is 
designed. For example, a chimeric gene encoding a portion of the F(ab) 2 fragment would 
include DNA sequences encoding the CHI domain and hinge region of the H chain, followed 
by a translation^ stop zodon to yield the truncated molecule. 

Consensus sequences of H and L J regions may be used to design oligonucleotides for 
use as primers to introduce useful restriction sites into the J region for subsequent linkage of 
V region segments to human C region segments. C region cDNA can be modified by site 
directed mutagenesis to place a restriction site at the analogous position in the human 
sequence. 

Expression vectors include plasmids, retroviruses, YACs, EBV derived episomes, and 
the like. A convenient vector is one that encodes a functionally complete human CH or CL 
immunoglobulin sequence, with appropriate restriction sites engineered so that any VH or VL 
sequence can be easily inserted and expressed. In such vectors, splicing usually occurs 
between the splice donor site in the inserted J region and the splice acceptor site preceding 
the human C region, and also at the splice regions that occur within the human CH exons. 
Polyadenylation and transcription termination occur at native chromosomal sites downstream 
of the coding regions. The resulting chimeric antibody may be joined to any strong promoter, 
including retroviral LTRs, e.g. SV-40 early promoter, (Okayama et aJ. (1983) Mol Cell Bin 
3:280), Rous sarcoma virus LTR (Gorman et ai (1982) P.N A S 79:6777), and moloney 
murine leukemia virus LTR (Grosschedl et ai (1985) £eJl 41:885); native Ig promoters, etc. 

In yet other embodiments of the invention, the active agent is an agent that modulates, 
and generally decreases or down regulates, the expression of the gene in the host. Antisense 
molecules can be used to down-regulate expression of the protein in cells. The anti-sense 
reagent may be antisense oligonucleotides (ODN), particularly synthetic ODN having 
chemical modifications from native nucleic acids, or nucleic acid constructs that express such 
anti-sense molecules as RNA. The antisense sequence is complementary to the mRNA of the 
targeted gene, and inhibits expression of the targeted gene products. Antisense molecules 
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inhibit gene expression through various mechanisms, e.g. by reducing the amount ofmRNA 
available for translation, through activation of RNAse H, or steric hindrance. One or a 
combination of antisense molecules may be administered, where a combination may comprise 
multiple different sequences. 

Antisense molecules may be produced by expression of all or a part of the target gene 
sequence in an appropriate vector, where the transcriptional initiation is oriented such that an 
antisense strand is produced as an RNA molecule. Alternatively, the antisense molecule is a 
synthetic oligonucleotide. Antisense oligonucleotides will generally be at least about 7, 
usually at least about 12, more usually at least about :,0 nucleotides in length, and not more 
than about 500, usually not more than about 50, more usually not more than about 35 
nucleotides in length, where the length is governed by efficiency of inhibition, specificity, 
including absence of cross-reactivity, and the like. It has been found that short 
oligonucleotides, of from 7 to 8 bases in length, can be strong and selective inhibitors of gene 
expression (see Wagner et al (1996), Nature Biotechnol 14:840-844). 

A specific region or regions of the endogenous sense strand mRNA sequence is 
chosen to be complemented by the antisense sequence. Selection of a specific sequence for 
the oligonucleotide may use an empirical method, where several candidate sequences are 
assayed for inhibition of expression of the target gene in an in vitro or animal model. A 
combination of sequences may also be used, where several regions of the mRNA sequence 
are selected for antisense complementation. 

Antisense oligonucleotides may be chemically synthesized by methods known in the 
art (see Wagner et al. (1993), supra, and Miliigan et al, supra) Preferred oligonucleotides 
are chemically modified from the native phosphodiester structure, in order to increase their 
intracellular stability and binding affinity. A number of such modifications have been 
described in the literature, which alter the chemistry of the backbone, sugars or heterocyclic 



Among useful changes in the backbone chemistry are phosphorothioates; 
phosphorodithioates, where both of the non-bridging oxygens are substituted with sulfur; 
phosphoroamidites; alkyl phosphotriesters and boranophosphates. Achiral phosphate 
derivatives include 3'-0'-5'-S-phosphorothioate, 3'-S-5'-0-phosphorothioate, 3'-CH 2 -5'-0- 
phosphonate and 3 '-NH-5 '-O-phosphoroamidate. Peptide nucleic acids replace the entire 
ribose phosphodiester backbone with a peptide linkage. Sugar modifications are also used to 
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enhance stability and affinity. The a-anoraer of deoxyribose may be used, where the base is 
inverted with respect to the natural P-anomer. The 2'-OH of the ribose sugar may be altered 
to form 2'-0-methyi or 2'-0-allyl sugars, which provides resistance to degradation without 
comprising affinity. Modification of the heterocyclic bases must maintain proper base pairing. 
Some useful substitutions include deoxyuridine for deoxythymidine; 5-methyl-2'- 
deoxycytidine and 5-bromo-2'-deoxycytidine for deoxycytidine. 5- propynyl-2 '-deoxyuridine 
and 5-propynyl-2'-deoxycytidine have been shown to increase affinity and biological activity 
when substituted for deoxythymidine and deoxycytidine, respectively. 

As an alternative to anti-sense inhibitors, catalytic nucleic acid compounds, 
ribozymes, anti-sense conjugates, etc. may be used to inhibit gene expression. Ribozymes 
may be synthesized in vitro and administered to the patient, or may be encoded on an 
expression vector, from which the ribozyme is synthesized in the targeted cell (for example, 
see International patent application WO 9523225, and Beigelman et al (1995), Nucl. Acids 
Res. 23:4434-42). Examples of oligonucleotides with catalytic activity are described in WO 
9506764. Conjugates of anti-sense ODN with a metal complex, e.g. terpyridylCu(II), capable 
of mediating mRNA hydrolysis are described in Bashkin et al (1995), Appl. Biochem. 
Biotechnoi 54:43-56. 

As mentioned above, an effective amount of the active agent is administered to the 
host, where "effective amount" means.a dosage sufficient to produce a desired result, where 
the desired result in the desired modulation, e.g. enhancement, reduction, of PDE interacting 
protein activity, which in turn leads to a desired effect on the state of the disease condition 
being treated, e.g. a reduction in the level of infl ammation etc. 

In the subject methods, the active agent(s) may be administered to the host using any 
convenient means capable of resulting in the desired inhibition of PDE interacting protein 
activity. Thus, the agent can be incorporated into a variety of formulations for therapeutic 
aarninistnuion. More particularly, the agents of the present invention can be formulated into 
pharmaceutical compositions by combination with appropriate, phannaceutically acceptable 
carriers or diluents, and may be formulated into preparations in solid, semi-solid, liquid or 
gaseous forms, such as tablets, capsules, powders, granules, ointments, solutions, 
suppositories, injections, inhalants and aerosols. 
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As such, administration of the agents can be achieved in various ways, including oral, 
buccal, rectal, parenteral, intraperitoneal, intradermal, transdermal, intracheal,etc., 
administration. 

In pharmaceutical dosage forms, the agents may be administered in the form of their 
pharmaceutically acceptable salts, or they may also be used alone or in appropriate 
association, as well as in combination, with other pharmaceuticaUy active compounds. The 
following methods and excipients are merely exemplary and are in no way limiting. 

For oral preparations, the agents can be used alone or in combination with appropriate 
additives to make tablets, powders, granules or capsules, for example, with conventional 
additives, such as lactose, mannitol, com starch or potato starch; with binders, such as 
crystalline cellulose, cellulose derivatives, acacia, com starch or gelatins; with disintegrators, 
such as corn starch, potato starch or sodium carboxymethylcellulose; with lubricants, such as 
talc or magnesium stearate; and if desired, with diluents, buffering agents, moistening agents, 
preservatives and flavoring agents. 

The agents can be formulated into preparations for injection by dissolving, suspending 
or emulsifying them in an aqueous or nonaqueous solvent, such as vegetable or other similar 
oils, synthetic aliphatic acid glycerides, esters of higher aliphatic acids or propylene glycol; 
and if desired, with conventional additives such as solubilizers, isotonic agents, suspending 
agents, emulsifying agents, stabilizers and preservatives. 

The agents can be utilized in aerosol formulation to be administered via inhalatioa 
The compounds of the present invention can be formulated into pressurized acceptable 
propellants such as dichlorodifluoromethane, propane, nitrogen and the like. 

Furthermore, the agents can be made into suppositories by rnixing with a variety of 
bases such as emulsifying bases or water-soluble bases. The compounds of the present 
invention can be adniinistered rectaliy via a suppository. The suppository can include vehicles 
such as cocoa butter, carbowaxes and polyethylene glycols, which melt at body temperature, 
yet are solidified at room temperature. 

Unit dosage forms for oral or rectal administration such as syrups, elixirs, and 
suspensions may be provided wherein each dosage unit, for example, teaspoonful, 
tablespoonful, tablet or suppository, contains a preueternuned amount of the composition 
containing one or more inhibitors. Similarly, unit dosage forms for injection or intravenous 
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administration may comprise the inhibitors) in a composition as a solution in sterile water, 
normal saline or another phannaceutically acceptable carrier. 

The term "unit dosage form," as used herein, refers to physically discrete units suitable 
as unitary dosages for human and animal subjects, each unit containing a predetermined 
quantity of compounds of the present invention calculated in an amount sufficient to produce 
the desired effect in association with a pharmaceutical^ acceptable diluent, carrier or vehicle. 
The specifications for the novel unit dosage forms of the present invention depend on the 
particular compound employed and the effect to be achieved, and the pharmacodynamics 
associated with each compound in the host. 

The phannaceutically acceptable excipients, such as vehicles, adjuvants, carriers or 
diluents, are readily available to the public. Moreover, pharmaceutical^ acceptable auxiliary 
substances, such as pH adjusting and buffering agents, tonicity adjusting agents, stabilizers, 
wetting agents and the like, are readily available to the public. 

Where the agent is a polypeptide, polynucleotide, analog or mimetic thereof, e.g. 
antisense composition, it may be introduced into tissues or host cells by any number of routes, 
including viral infection, microinjection, or fusion of vesicles. Jet injection may also be used 
for intramuscular administration, as described by Furth et al (1 992), Anal Biochem 205:365- 
368. The DNA may be coated onto gold microparticles, and delivered intradermally by a 
particle bombardment device, or "gene gun" as described in the literature (see, for example, 
Tang et al (1992), Nature 356:152-154), where gold microprojectiles are coated with the 
DNA, then bombarded into skin cells. 

Those of skill in the art will readily appreciate that dose levels can vary as a function 
of the specific compound, the severity of the symptoms and the susceptibility of the subject to 
side effects. Prefened dosages for a given compound are readily determinable by those of skill 
in the art by a variety of means. 

The subject methods find use in the treatment of a variety of different disease 
conditions involving PDE interacting protein activity, particularly in those disease conditions 
in which the selective inhibition of PDE activity, more particularly PDEIV activity, results in 
treatment of the disease condition where targeting of the PDE interacting protein by the 
therapeutic agent results in modulated, e.g. reduced or enhanced, activity of its corresponding 
PDE. 
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Specific disease of interest as treatable by the subject methods include: asthma, 
including inflamed lung associate asthma, cystic fibrosis, inflammat ory airway disease, chronic 
bronchitis, eosinophilic granuloma, psoriasis and other benign and malignant proliferative skin 
diseases, endotoxic shock, septic shock, ulcerative colitis, Crohn's disease, reperfusion injury, 
or the myocardium and brain, inflammatory arthritis, chronic gloerulonephritis, atopic 
dermatitis, urticaria, adult respiratory distress syndrome, diabetes insipidus, allergic rhinitis, 
allergic conjunctivitis, vernal conjunctivitis, arterial restenosis and artherosclerosis, 
infl a mm atory diseases associated with irritation and pain, rheumatoid arthritis, ankylosing 
spondylitis, transplant rejection and graft versus host disease, disease conditions associates 
with hypersecretion of gastric acid, disease conditions in which cytokines are mediators, e.g. 
sepsis, and septic shock, and the like. 

By treatment is meant at least an amelioration of the symptoms associated with the 
pathological condition afflicting the host, where amelioration is used in a broad sense to refer 
to at least a reduction in the magnitude of a parameter, e.g. symptom, associated with the 
pathological condition being treated, such as inflammation, etc. As such, treatment also 
includes situations where the pathological condition, or at least symptoms associated 
therewith, are completely inhibited, e.g. prevented from happening, or stopped, e.g. 
terminated, such that the host no longer suffers from the pathological condition, or at least 
the symptoms that characterize the pathological condition. 

A variety of hosts are treatable according to the subject methods. Generally such hosts 
are "mammals" or ''mammalian," where these terms are used broadly to describe organisms 
which are within the class mammalia, including the orders carnivore (e.g., dogs and cats), 
rodentia {e.g., mice, guinea pigs, and rats), and primates (e.g., humans, chimpanzees, and 
monkeys). In many embodiments, the hosts will be humans. 

Kits with unit doses of the active agent, usually in oral or injectable doses, are 
provided. In such kits, in addition to the containers containing the unit doses will be an 
informational package insert describing the use and attendant benefits of the drugs in treating 
pathological condition of interest. Preferred compounds and unit doses are those described 
herein above. 

The following examples are offered primarily for purposes of illustration. It will be 
readily apparent to those skilled in the art that the formulations, dosages, methods of 
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administration, and other parameters of this invention may be further modified or substituted 
in various ways without departing from the spirit and scope of the invention. 

EXPERIMENTAL 
I. Screening of the yeast two hybrid system cDNA brain library 

To identify proteins that interact with a PDE4, cDNA coding for the amino terminus 
of PDE4D3 or for a region corresponding to a.a. 1 14-672 were inserted into pGBT9 vectors 
and used for screening of a Matchmaker rat brain library subcloned in pGAD 1 0 vector 
(Clontech, Palo Alto, CA). The fragment encoding the autoinhibitory (UCR2), catalytic, and 
carboxy tenninal domains of rPDE4D3 (aa 1 14-672) was amplified by PCR with the full- 
length cDNA using the following forward and reverse primers with incorporated restriction 
sites and Stop codoa EcoRI: 5' CGG AAT TCG AGG AGG CCT ACC AGA AAC 3' 
(GUPA4) (SEQ ID NO:06) and Sall/TAG: 5' TGA GTC GAC TAC GTG TCA AGG CAA 
CAA TGG TC 3' (GUPA3) (SEQ ID NO:07). The PCR products were cloned into 
EcoRI/SaU site of pGBT9 (Clontech) downstream of the Gal4 activation domain. The PCR 
was performed in presence of recombinant Pfu polymerase (Stratagene) at low cycle number 
(10 cycles) to ensure high fidelity reading. The insertions were entirely sequenced to confirm 
the correct reading frame and the sequence. Sequencing was performed by the Molecular 
Biology facility at Stanford University using the ABI PRISM Dye Teraiinator Cycle 
Sequencing Ready Reaction Kit with AmpIiTaq DNA Polymerase, FS (Perkin Elmer). 

Of the positive clones isolated from the screening of the rat brain library, 187 gave 
strong positive signal while 8 1 gave only a weak signal. Of the strong positive clones, PBP46 
was further characterized. This clone contained an insert of aprroximately 2.8 kb. The 
interaction of the clone with the PDE was confirmed by subcloning the cDNA fragment in 
both pGBT9 and pGADIO and by testing growth and 0-galactosidase activity in the yeast two 
hybrid system. The clone continued to show strong interaction with the 1 .6 fragment of 
PDE4D3. 



II. Screening for the full length myomegalin clone 

A homology search (BLAST) using the sequence of PBP46 clone showed no 
significant identity to sequences in any public domain database. This clone was then used to 
probe a blot with RNA from multiple tissues. A transcript of approximately 8.0-8.5 kb 
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hybridized to the probe in several tissues, the highest ievei of expression being observed in the 
rat skeletal muscle and heart. Lower levels of expression were detected in brain, liver and 
lung. In the testis a transcript of 2.0-2.4 kb was consistently observed. The expression in the 
testis was confirmed by PCR and by screening a rat testis library. Two clones containing the 
3' end sequence of myomegalin were retrieved from this library. 

To obtain the complete sequence of the 8.0-8.5 transcript, a rat skeletal muscle cDNA 
library was screened with the PBP46 cDNA. From this screening, 2 clones were retrieved. 
However, the clones did not yield a complete ORF. Screening was then repeated six more 
times with oligonucleotides corresponding to the 5' end of the longest clones. From this 
multiple screening, 21 overlapping clones were obtained. Merging of the sequences from the 
different clones yielded a 9 kb sequence, a size in agreement with the size of the transcript 
derived from rat heart and skeletal muscles. See Fig. 2. Conceptual translation of the 
nucleotide sequence uncovered an open reading frame of a protein of 2324 amino acids 
corresponding to a calculated MW of 261 kDa. See Fig. 1. 

To analyze tissue distribution of the rat myomegalin transcripts, Northern blot analysis 
was performed using radioactively labeled probes corresponding to the 3 1 end (probe 1; 1000 
bp) and the 5' end (probe 2; 665 bp) of the myomegalin open reading frame. Transcripts of 
various sizes were found in various tissues using either probe 1 or probe 2 or both. The 
results indicated that there are at least four different transcripts of rat myomegalin: two 
expressed in heart (7.5 and 5.9 kb); two in skeletal muscle (7.5 and 4.3 kb) and one in testis 
(2.5 kb). The 2.5 kb variant roughly corresponds to the PBP46 clone, and is expressed 
exclusively in rat testis. 



ID. Screening of the EST/database 

To determine whether mouse or human sequences analogous to the rat myomegalin 
are present in public domain databases, the rat sequence was used for a BLAST search of 
GenBank and EST libraries. The following EST were retrieved. AA755885, AA 11 0441, 
W23471, AA333456, AA489265. These sequences are more than 90% homologous to the r 
sequence. Sequence AL021920 contains a genomic fragment from human chromosome 
lp35. l-p36.21. Several exons overlap with the rat sequence from residue 1215 until residue 
1444. Thus myomegalin must reside on human chromosome lp35. l-p36. KIAA0454 
(accession # AB007923), KIAA0477 (acccesion # AB007946) are two clones containing 
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portion of the human myomegalin sequence since they are more than 90% homologous to the 
rat ORF. These human clones were merged to obtain a full length human sequence 
homologous to myomegalin. See Fig. 4. The human open reading frame coded for a protein 
of 25 17 residues and a calculated molecular weight of 282. 1 kDa. See Fig. 5. 

Alignment of the human and rat sequence showed identity from aa 235 of rat 
myomegalin to the end. In the amino terminus region, the two sequences showed only weak 
homologies. The reason for this discrepancy is at present unclear. It is possible that it is due 
to species differences. The junction where the rat sequence diverges from the human was 
derived from four clones isolated from the rat skeletal muscle library, lessening the possibility 
that cloning artifact is at the basis of this discrepancy. The presence of the junction was 
further confirmed by PCR analysis of rat heart mRNA (data not shown). However, further 
blast searches with the region encompassing the 5' end of myomegalin did not yield mouse 
EST fragments overlapping the junction. Conversely, several EST clones confirming the 
human junction were retrieved from human and mouse EST databases. 



IV Protein/protein interaction 

Several attempts were made to confirm the interaction between myomegalin and 
PDE4D3. However, due to the insolubility of the full length or truncated myomegalin 
irnmunoprecipitation experiments could not be performed. In an alternative approach, PBP46 
was cotransfected with PDE4D3 in COS 7 cells and the PDE activity was determined in the 
particulate fraction of the cell. If PDE4D3 interacts with PBP46, an increase in the particulate 
PDE activity would be expected. Two to three fold increase in the particulate PDE4D3 
activity was detected when plasmids containing PBP46 and PDE4D3 were cotransfected in 
COS7 cells. 

V Subcellular localization of myomegalin 

To investigate the subcellular localization of myomegalin the PBP46 clone was 
subcloned in frame to a flag tag and expressed in COS7 cells. The recombinant protein thus 
obtained was entirely recovered in the particulatefraction and could be extracted only with 
buffer containing SDS. Expression in transfected cells was further assessed by 
immunofluorescence (IF) using the flag antibody. The flag tagged recombinant protein 
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encoded in PBP46 was entirely localized in the Golgi/centrosomal region of COS7 cells. No 
attempts were made to express the full-length myomegalin cDNA. 

VI. Western blot analysis of muscle and testis extracts 

Polyclonal antibodies were raised in rabbit against peptides corresponding to the 
carboxyl tenninus region of myomegalin. These antibodies recognize in testis a protein of 
approximately 64 kDa. In heart and muscle, proteins of 280,250 and 200 kDa were observed. 
It is at present unknown whether these are native proteins or products of proteolysis. When 
these antibodies were used for IF localization a region corresponding to the 
Golgi/centrosomal region is intensely labeled. 



It is apparent from the above results and discussion that polynucleotides encoding 
novel mamm alia n PDE interacting proteins, such as myomegalin, as well as the novel 
polypeptides encoded thereby, are provided. The subject invention is important for both 
1 5 research and therapeutic applications. For example, identification of the subject PDE 

interacting proteins provides for the ability to screen potential PDE inhibitors with PDE/PDE 
interacting protein complexes, where the results of such screening procedures should be more 
indicative of in vivo activity of a potential agent than screening procedures in which PDE is 
used by itself. Accordingly, the subject invention provides for a significant contribution to the 
20 art. 

All publications and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. The citation of any 
publication is for its disclosure prior to the filing date and should not be construed as an 
25 admission that the present invention is not entitled to antedate such publication by virtue of 
prior invention 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it is readily apparent to 
30 those of ordinary skill in the art in light of the teachings of this invention that certain changes 
and modifications may be made thereto without departing from the spirit or scope of the 
appended claims. 
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WHAT IS HLAIMFn Tfr 

1 . A polynucleotide present in other than its natural environment encoding a PDE 
interacting polypeptide. 



2. The polynucleotide according to Claim 1 , wherein said polynucleotide encodes 
a myomegalin protein. 

3 . A fragment of a polynucleotide according to Claim 1 . 

4. An PDE interacting polypeptide present in other than its naturally occurring 
environment. 

5. The polypeptide according to Claim 4, wherein said polypeptide is a 
myomegalin protein. 



6. A fragment of a polypeptide according to Claim 4. 

7. Substantially pure PDE interacting protein. 

8 . Isolated PDE interacting protein. 



9. An expression cassette comprising a transcriptional initiation region functional 
in an expression host, a polynucleotide having a nucleotide sequence found in the nucleic acid 
according to Claim I under the transcriptional regulation of said transcriptional initiation 
region, and a transcriptional termination region functional in said expression host. 

10. A cell comprising an expression cassette according to Claim 9 as part of an 
extrachromosomal element or integrated into the genome of a host cell as a result of 
introduction of said expression cassette into said host cell. 



The cellular progeny of the cell according to Claim 10. 
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12. A method of producing an PDE interacting polypeptide, said method 
comprising: 

growing a cell according to Claim 10, whereby said polypeptide is expressed; and 
isolating said polypeptide substantially free of other proteins. 

13. A monoclonal antibody binding specifically to a PDE interacting protein. 

14. The monoclonal antibody according to Claim 13, wherein said antibody 
inhibits the activity of at least one of PDE or a PDE interacting protein. 

15. The monoclonal antibody according to Claim 13, wherein said antibody is a 
humanized antibody. 

16. A method of determining whether an agent modulates the activity of a PDE, 
said method comprising: 

contacting a complex of said PDE and a PDE interacting protein with said agent; and 
determining the effect of said agent on the activity of said PDE. 

17. The method according to Claim 1 6, wherein said agent is a small molecule. 

1 8. The method according to Claim 1 6, wherein said agent is an antibody. 

19. The method according to Claim 1 8, wherein said agent is a monoclonal 
antibody. 

20. A method for modulating the activity of a PDE interacting protein, said 
method comprising: 

contacting said PDE interacting protein with an agent that modulates the activity of 
said PDE interacting protein. 
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FIG. 1 



>myoraegalin protein 

MSNGYRTLSQHLNDLKKENFSLKLRIYFLEEEIMQQECYEVSREDVYKRNIELKVEVESLKRELQDRKQHL 

HKTWADEEDLNSQNEAELRRQVEEPQQETEHVYELLDNNIQLLQEESRFAKDEATQMETLVEAEKGCNL 

ELSERWKDATKNREDAPGDQVKLDQYSAALAQRDRRIEELRQSLAAQEGLVEQLSREKQQLLHLLEEPG 

GMEVQPMPKGLPTQQKPDLNETPTTQPSVSDSHLAELQDKIQQTEVTNKILQEKLNDMSCELRSAQESS 

QKQDTTIQSLKEMLKSRESETEELYQVIEGQNDTMAKLPEMLHQSQLGQLQSSEGIAPAQQQVALLDLQ 

SAL FCS QLE I QKLQ RLLRQKERQLADG KRCMQ FVE AAAQE REQQ KEAAWKHNQ E LRKALQH LQGELHS K 

SQQLHVLEAEKYNEIRTQGQNIQHLSHSLSHKEQLIQELQELLQYRDTTDKTLDTNEVFLEKLRQRIQD 

RAVALERVIDEKFSALEEKDKELRQLRLAVRDRDHDLERLRCVLSANEATMQSMESLLRARGLEVEQLI 

ATCQNLQWLKEELETKFGHWQKEQESIIQQLQTSLHDRNKEVEDLSATLLHKLGPGQSEVAEELCQRLQ 

RKERVLQDLLS DRNKQAMEHEME VQGLLQSMGT REQERQAVAEKMVQAFMERNS ELQALRQ YLGGKELM 

AASQAFISNQPAGATSVGPHHGEQTDQGSTQMPSRDDSTSLTAREEASIPRSTLGDSDTVAGLEKELSN 

AKEELELMAKKERESQIELSALQSMMAVQEEELQVQAADLESLTRNIQIKEDLIKDLQMQLVDPEDMPA 

MERLTQEVLLLREKVASVEPQGQEGSENRRQQLLLMLEGLVDERSRLNEALQAERQLYSSLVKFHAQPE 

ISERDRTLQVELEGAQVLRSRLEEVLGRSLERLSRLETLAAIGGATAGDETEDTSTQFTDSIEEEAAHN 

SHQQLIKVSLEKSLTTMETQNTCLQPPSPVGEDGNRHLQEEMLHLRAEIHQPLEEKRKAEAELKELKAQ 

IEEAG FS SVSHI RNTMLSLCLCLENAELKEQMGEAMS DGWEVEEDKEKGEVMVETWAKGGLSEDSLQA 

EFRKVQGRLKSAYNIINLLKEQLVLRSSEGNTKEMPEFLVRLAREVDRMNMGLPSSEKHQHQEQENMTA 

RPGPRPQSLKLGTALSVDGYQLENKSQAQDSGHQPEFSLPGSTKHLRSQLAQCRQRYQDLQEKLLISEA 

TVFAQANQLEKYRAILSESLVKQDSKQIQVDLQDLGYETCGRSENEAEREETTSPECEEHGNLKPWLV 

EGLCSEQGYLDPVLVSSPVKNPWRTSQEARRIQAQGTSDNSSLLRKDIRNLKAQLPNAYKVLQNLRSRV 

RSLSATSDYSSSLERPRKLIAVATLEGASPHSVTDEDEGLLSDGTGAFYPPGLQAKKNLENLIQRVSQL 

EAQLPKTGLEGKLAEELKSASWPGKYDSLIQDQARKTVISASENTKREKDLFSSHPTFERYVKSFEDLL 

RNNDLTTYLGQSFREQLSSRRSVTDRLTSKFSTKDHKSEKEEVGLEPLAFRFSRELQEKEKVIEVLQAK 

VDTRFFSPPSSHAASESHRCASSTSFLSDDIEACSOMDVASEYTHYEEKKPSPSNSAASASQGLKGEPR 

SSSISLPTPQNPPKEASQAQPGFHFNSIPKPASLSQAPMHFTVPSFMPFGPSGPPLLGCCETPWSLAE 

AQQELQMLQKQLGRSVSIAPPTSTSTLLSNHTEASSPRYSNPAQPHSPARGTIELGRILEPGYLGSGQW 

DMMRPQKGS I S GELS SGSSMYQLNS KPTGADLLEEHLGEI RNLRQRLEES I CVN DRLREQLQHRLS ST A 

RENGSTSHFYSQGLESMPQLYNENRALREENQSLQTRLSHASRGHSQEVDHLREALLSSSSQLQELEKE 

LEQQKAERRQLLEDLQEKQDEIVHFREERLSLQENNSRLQHKLALLQQQCEEKQQLSLSLQSELQIYES 

LYENPKKGLKAFSLDSCYQVPGELSCLVAEIRALRVQLEQSIQVNNRLRLQLEQQMDHGAGKASLSSCP 

VNQSFSAKAELANQQPPFQGSAASPPVRDVGLNSPPWLPSNSCSVPGSDSAIISRTNNGSDESAATKT 

PPKMEVDAADGPFASGHGRHVIGHVDDYDALQQQIGEGKLLIQKILSLTRPARSVPALDAQGTEAPGTK 

SVHELRSSARALNHSLEESASLLTMFWRAALPNSHGSVLVGEEGNLMEKELLDLRAQVSQQQQLLQSTA 

VRLKTANQRKKSMEQFIVSHLTRTHDVLKKARTNLEMKSFRALMCTPAL (SEQ ID NO: 01) 
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CCGGTCCCCTTTGGTAGTAGTATCTCAGAGCTCGCCCCATAGTTTCATAGTTCATGTCTGGTTTGTTCT 

TATGCTTTCCCCAGAGCTTCGAGACAGCCTTTGAGTCCACCAGCTTGAATATGCCCTTTTCTCTCTGAG 

TCCAT T T AATAT AC CTGGGACAAGTATTT TT ATC TTGAAGCAGAT CTAAAAGAAACTCCCACAGATAGG 

TTGTGTTTCCTTTTCCTTCTCTGGCTTTCTTCTTGACTCCTAACTCAGGAGACCCATTGGAAACTGGTG 

ACTGCTGGGTCTTTGGTTTACGGCCAACTTTCTTCTTTTTCATTGGTTCGTGGCTGTCTGGTGAAGTAT 

GGATAGGCGAGGCATCCATTGGTTCAGACTCCTC7GTTGACACCTCCACTACAGTCTCCGTAATGACAT 

CTGGCCTCATCGCAGCATGGATAAAATCGGATTCTTGAATCCTCAAGCAGGTAGGAGACTCCATATGAA 

GCAGGGCTTCAGCAGCTTCAATGGTCTTATCCGTACAGTGTGCATTACTGCTGTGAACTGATGCTTCCA' 

CGGCCCGGATGAGCAGATCCAGCTGGTTCGTGGGTCCCTCATGCAGAGACGTCGCCATGTTTATCCCrr 

GGCTGGAACTGCTGCTTCACATTGACTTACACCCTGAGCAGCGGCGACAGGGGAGAAGGCGGAACCCGC 

GGCCGGAGACACACGCCGTGCGGGCGGCACACACTCACGCACTCGCACACACTCCGACGCCCGGATCrT 

TGCGCGTCCTCCGACAGGAAGCGGCGGCCGGCCCGCCTCCCGCCGCCGGGCTGAGCAGCCCCACCACCT 

AA.CGGCAGGGGCGGCGGCGGCCCCGGCTGGCAACGCGATCCTTCCGCCCCGCGCCCAGACAGGAAGTCC 

CGGGCGCCGGCAGCCAGCGGCCGCACGGACACCTGAGGCTGGGGAGCCCGCAGGCCGCCCTCGGGGACG 

CGGGCCTCGGCAGGAAAAGGCGCGCTTCACGTTCTGCGGAAGCGAAGTCTGCAAATGTCCCCTCAGCAT 

GGTCTTCCTCCTGGCTCAATCTGTCTCACCTTCAGGTGATCCTAGGACTGGGGCTCCTTTCCAGGTCCC 

CAGTTTCTCAAGTCGATCTTCTACCTCCCTCTTGATTTTCTACTCCATTGCTGGAAAGCTCCAGAACAG 

AGCCTCCGCCGCCAACCACTGCTGATGCCATCGCGTCTTCCCTGAGCAAGTTTCGAACGCTGCGAATCA 

ATGTAATTACGGCTCAGATGATTGCCAGGGTTATCGGTTTCATGTTCTAATTCAATAGTGATGGAGTAG 

ACATCCAGAAGTCCAGTCTTCTAAAGATGATTAACCAGAGGGTAGTTTGACGGTTAAGTAGTCTAAGCA 

TCCTTCACCGTTTCCACACTCCCAAGAGCTGAACTCTAAACCAGCAGCTCTCTGGAGCTACTGCTCTCC 

CTCCACGTCGCCGTGTCCCTTGCCCTTCCCCTCAGGGCCGCAGACCGGCCGAGCCGCCGCAGCCGCCCG 

CCGTTGGCCCGGCGTCCTGCGGGAAGCCGAGGGGGCCTCCCCGGGCCACCGCGCGAGCCGCTCCGCACC 

ACAGGACGAGACAAACCGCGGCTATGTCGCCTTAGCCCTCGGGGTCCCACAGCCTCAGCAGCGTCCTAG 

CCTGCCCGCTCCATGCCACGGCAAGGCTGCACCGTGTTCCAGGGGTGAAGGGGGCGATCGGGCATGCTC 

CTCCCCATGGGTCGCCCACCATGTCTAATGGATATCGCACTCTGTCCCAGCACCTCAATGACCTGAAGA 

AGGAGAACTTCAGCCTCAAGCTGCGCATCTACTTCCTGGAGGAGCGCATGCAACAGAAGTATGAAGTCA 

GCCGGGAGGACGTCTACAAGCGGAACATTGAGCTGAAGGTTGAAGTGGAGAGCCTGAAACGAGAGCTCC 

AGGACAGGAAACAGCATCTACATAAAACATGGGCCGATGAGGAGGATCTCAACAGCCAGAATGAAGCAG 

AGCTCCGGCGCCAGGTTGAAGAACCGCAGCAGGAGACAGAACACGTTTATGAGCTCCTAGACAACAACA 

T T^GCTGCTGCAGGAGGAATCCAGGTTT GCAAAGGATGAAGCCACACAGATGGAGACTCTGGTGGAGG 

CAGAGAAGGGGTGTAATCTGGAGCTCTCAGAGAGGTGGAAGGATGCTACCAAGAACAGGGAAGATGCAC 

CGGGAGACCAGGTGAAGCTTGACCAATATTCTGCGGCACTGGCTCAGAGGGACAGGAGAATTGAAGAGC 

TGAGGCAGAGCTTGGCTGCCCAGGAGGGGCTTGTGGAACAGCTGTCTCGAGAGAAACAACAACTGTTAC 

ATCTGCTGGAGGAGCCTGGGGGCATGGAAGTGCAGCCCATGCCTAAAGGGTTACCCACGCAACAAAAGC 

CAGACCTAAATGAGACCCCTACAACCCAGCCATCTGTGTCTGATTCCCACCTGGCAGAACTCCAGGACA 

AAATCCAGCAAACAGAGGTCACCAACAAGATTCTTCAAGAGAAACTGAATGACATGAGCTGTGAGCTCA 

GATCTGCACAGGAGTCGTCTCAGAAGCAAGATACGACAATCCAAAGCCTCAAGGAAATGCTAAAGAGCA 

GGGAAAGTGAGACTGAAGAGCTGTACCAGGTGATTGAAGGTCAAAATGACACAATGGCAAAGCTTCCGG 

AAATGCTACACCAGAGCCAGCTCGGACAGCTCCAGAGCTCAGAGGGCATTGCCCCTGCTCAGCAGCAAG 

TGGCCCTGCTTGACCTTCAGAGTGCTCTGTTCTGCAGCCAGCTTGAAATCCAGAAGCTCCAGAGGCTGT ■ 

TACGCCAGAAAGAGCGTCAGCTGGCTGACGGCAAGCGGTGCATGCAATTTGTGGAGGCTGCAGCACAGG 

AGAGAGAGCAGCAGAAGGAAGCTGCTTGGAAACATAACCAGGAATTACGAAAAGCTTTGCAACACCTCC 

AAGGAGAACTGCACAGTAAGAGCCAACAGCTCCACGTTCTGGAGGCAGAAAAATATAATGAAATTCGAA 

C CCAGGGAC AAAACATT CAACACCT AAG T CACAGT CT GAG TCACAAAGAG CAGCTAAT TCAGGAACTTC 

AGGAGCTCCTACAGTATCGGGATACCACAGACAAAACTCTAGACACAAATGAGGTGTTTCTTGAGAAAC 
TACGGCAACGAATACAAGACCGGGCAGTTGCTCTAGAGCGGGTTATAGATGAAAAGTTCTCTGCTCTAG 
AAGAAAAGGACAAGGAACTGCGGCAGCTCCGGCTTGCTGTGAGGGACCGAGACCATGACTTAGAGAGAC 
TGCGTTGTGTCCTGTCTGCCAATGAAGCTACCATGCAAAGTATGGAGAGTCTCCTGAGGGCCAGAGGCC 
TGGAAGTGGAGCAGTTAATTGCCACCTGCCAAAACCTCCAGTGGTTGAAGGAAGAATTGGAAACCAAGT 
TTGGCCACTGGCAGAAGGAACAGGAGAGCATCATTCAGCAGTTACAGACATCTCTGCATGACAGGAACA 
AAGAAGTAGAGGATCTCAGTGCAACTTTGCTCCACAAACTTGGACCCGGCCAGAGTGAAGTAGCTGAGG 
AGCTGTGCCAGCGCCTGCAGCGGAAGGAAAGGGTGCTGCAGGACCTTCTGAGTGATCGGAACAAACAAG 
CCATGGAGCACGAGATGGAGGTCCAGGGACTGCTCCAGTCGATGGGCACCCGGGAACAGGAAAGACAGG 
CTGTTGCAGAAAAAATGGTACAAGCCTTCATGGAAAGAAACTCGGAATTACAGGCCCTGCGGCAGTATC 
TAGGGGGGAAGGAATTAATGGCAGCATCTCAGGCATTCATCTCTAACCAACCAGCTGGAGCGACTTCTG 
TAGGCCCCCACCATGGAGAGCAAACTGACCAAGGTTCTACGCAGATGCCCTCTCGAGACGACAGCACCT 
CGCTGACT GC CAGAGAGGAGGCCAGCATACCCCGGTCTAC ATT AG GAGACTCAGACACAGT TGCAGGGC 
TGGAGAAAGAACTGAGCAATGCCAAGGAGGAGCTTGAGCTCATGGCCAAAAAAGAAAGAGAAAGCCAGA 
TAGAATTGTCTGCCCTGCAGTCCATGATGGCTGTGCAAGAGGAAGAGCTGCAGGTGCAGGCTG^TGAC? 
TGGAGTCCCTCACCAGGAACATACAGATAAAAGAAGACCTCATAAAGGACCTGCAAATGCAACTGGTTG 



WO 00/27861 PCT/US99/26860 



3/12 
FIG. 2 (CONT) 

ACCCTGAAGATATGCCAGCCATGGAGCGCCTGACCCAAGAGGTCTTACTTCTTCGGGAAAAAGTTGCTT 

CAGTGGAACCCCAGGGTCAGGAAGGGTCAGAGAACAGGAGACAACAGT7GCTGCTGATGTTAGAAGGAC 

TAGTGGATGAACGGAGTCGGCTCAACGAGGCCCTGCAAGCTGAGCGGCAGCTCTACAGCAGCCTGGTCA 

AGTTCCATGCCCAACCAGAGATCTCTGAGAGAGACCGAACTCTGCAGGTGGAACTGGAAGGGGCCCAGG 

TGTTACGCAGTCGACTAGAAGAAGTTCTTGGAAGAAGCCTGGAGCGCTTAAGCAGGCTGGAGACCCTGG 

CCGCCATTGGAGG7GCTACTGCAGGCGATGAGACTGAAGATACAAGCACACAGTTCACAGACAGCATTG 

AGGAGGAGGCTGCACACAACAGCCACCAGCAACTCATCAAGGTGTCTTTGGAGAAAAGCCTGACCACCA 

TGGAGACCCAGAACACATGTCTTCAGCCCCCTTCCCCAGTAGGAGAGGATGGTAACAGGCATCTTCAGG 

AAGAAATGCTCCACCTGAGGGCTGAAATCCACCAGCCCTTAGAAGAGAAGAGAAAAGCTGAGGCAGAAC 

TCAAGGAGCTAAAGGCTCAAATTGAGGAAGCAGGATTCTCCTCTGTGTCCCACATCAGGAACACCATGC 

TGAGCCTTTGCCTTTGCCTTGAGAATGCAGAGCTGAAAGAGCAGATGGGAGAAGCAATGTCTGATGGAT 

GGGAGGTGGAGGAAGACAAGGAGAAGGGCGAGGTGATGGTGGAGACCGTGGTGGCCAAAGGGGGTCTGA 

G TGAGG ACAGCC?TCAGGCTGAGTTCAGG> AAGT CCAGGGGAGACT CAAGAGTGCCTACAACATCATCA 

ACCTCCTCAAAGAGCAGCTGGTCCTGAGAAGCTCGGAAGGGAACACTAAGGAGATGCCAGAGTTCCTCG 

TGCGCCTGGCCAGGGAGGTGGACAGAATGAACATGGGCTTGCCTTCCTCGGAGAAGCATCAACACCAAG 

AACAGGAGAATATGACCGCAAGGCCTGGCCCCAGGCCCCAGAGTCTCAAGCTTGGGACAGCTCTCTCAG 

TAGACGGCTACCAACTGGAGAACAAGTCCCAGGCCCAAGACTCTGGACATCAGCCAGAATTTAGCCTAC 

CAGGGTCCACCAAACACCTGCGCTCCCAGCTGGCTCAGTGTAGACAACGGTACCAAGATCTCCAGGAGA 

AGCTGCTCATCTCAGAAGCCACTGTGTTTGCCCAGGCAAACCAGCTAGAGAAGTACAGAGCCATATTAA 

GTGAATCCCTGGTGAAGCAGGACAGCAAGCAGATCCAGGTGGACCTTCAGGACCTGGGCTATGAGACTT 

GTGGCCGAAGTGAGAATGAAGCTGAACGTGAGGAGACCACCAGCCCTGAGTGTGAGGAGCACGGTAACC 

TGAAGCCTGTGGTGCTGGTGGAAGGCTTGTGCTCTGAGCAAGGGTACCTGGACCCTGTCTTGGTCAGCT 

CACCTGTGAAGAACCCTTGGAGAACAAGCCAGGAAGCCAGAAGAATCCAGGCACAAGGAACTTCAGACA 

ACAGCTCTCTCCTGAGGAAGGACATCCGAAATCTGAAAGCCCAGCTACCGAATGCCTACAAGGTCCTTC 

AGAACCTGAGGAGCCGGGTCCGGTCCCTGTCTGCCACAAGCGATTACTCATCGAGTCTGGAGAGACCCC 

GCAAGCTGATAGCCGTGGCAACCCTTGAGGGGGCCTCACCCCACAGTGTCACTGATGAAGACGAAGGCT 

TGTTGTCAGATGGCACCGGGGCTTTTTACCCTCCAGGGCTCCAGGCCAAAAAGAATCTAGAGAATCTCA 

TCCAGAGAGTATCCCAGCTGGAGGCCC^GCTCCCCJUUUVCTGGACTAGAAGGGAAGCTGGCTGAAGAAC 

TGAAGTCCGCCTCGTGGCCTGGAAAATACGATTCTTTGATTCAGGATCAGGCCCGAAAAACTGTCATAT 

CTGCGTCCGAAAATACXAAAAGGGAGAAGGATTTGTTTTCTTCTCACCCAACATTCGAAAGATACGTCA 

AATCTTTTGAAGACCTCCTGAGGAACAACGACTTGACTACTTACCTGGGCCAGAGCTTCCGGGAACAAC 

TTAGTTCAAGGCGTTCAGTGACAGACAGGCTGACCAGCAAATTCAGCACAAAGGATCATAAGAGTGAAA 

AAGAAGAAGTTGGGCTTGAGCCACTGGCCTTCAGGTTCAGCAGGGAATTACAGGAGAAAGAGAAAGTGA 

TTGAAGTCCTGCAGGCCAAGGTGGATACCCGGTTTTTCTCACCCCCCAGCAGCCATGCTGCGTCTGAGT 

CCCACCGTTGTGCCAGCAGCACATCTTTCCTGTCGGATGACATAGAAGCCTGCTCTGACATGGACGTAG 

CCAGCGAGTACACACACTATGAAGAGAAGAAGCCCTCACCCAGTAACTCAGCAGCCAGTGCATCTCAGG 

GGCTTAAGGGCGAGCCCAGAAGCAGCTCCATCAGCTTGCCAACTCCCCAGAACCCCCCTAAGGAGGCCA 

GCCAGGCCCAGCCAGGCTTTCACTTTAACTCCATACCCAAGCCGGCTAGCCTTTCCCAGGCACCAATGC 

ACTTCACTGTACCCAGCTTCATGCCTTTCGGCCCCTCTGGGCCTCCCCTTCTTGGTTGCTGTGAGACAC 

CAGTGGTGTCCTTGGCTGAGGCTCAACAAGAGCTGCAGATGCTGCAGAAGCAGCTGGGACGAAGTGTTA 

GCATTGCCCCTCCCACCTCCACATCCACGTTGCTTAGCAACCACACAGAAGCTAGCTCTCCCCGCTACA 

GCJUiCCCTGCTCAGCCCCACTCCCCAGCAAGGGGCACCATAGAGCTGGGCAGAATCCTGGAGCCTGGAT 

ACCTGGGCAGCGGCCAGTGGGACATGATGAGGCCTCAGAAAGGGAGCATCTCTGGGGAGCTGTCCTCAG 

GCTCCTCGATGTACCAGCTTAACTCCAAACCCACAGGGGCCGACCTGTTGGAAGAGCATTTAGGTGAGA 

TCCGGAACCTGCGCCAGCGCCTGGAGGAGTCCATATGTGTCAATGACAGGCTACGGGAGCAGCTGCAGC 

ATAGGCTCAGCTCCACGGCCCGAGAAAATGGTTCCACCTCTCACTTCTACAGTCAGGGCCTGGAGTCCA 

TGCCTCAGCTCTACAATGAGAACAGAGCCCTCAGGGAAGAAAACCAAAGCCTGCAGACACGGCTCAGTC 

ATGCTTCCAGGGGACACTCCCAGGAAGTGGACCACCTGAGGGAGGCTCTGCTTTCCTCAAGTTCCCAGC 

TCCAGGAGCTGGAGAAGGAGCTGGAGCAGCAGAAGGCTGAGCGGCGGCAGCTTCTGGAAGACTTGCAGG 

AGAAGCAGGATGAGATCGTGCATTTCCGAGAGGAGAGGCTGTCCCTCCAGGAAAACAACTCCAGGCTGC 

AGCACAAGCTGGCCCTCCTGCAACAACAGTGTGAGGAGAAACAGCAGCTCTCCCTGTCCCTGCAGTCAG 

AGCTCCAGATCTACGAGTCCCTCTACGAAAATCCTAAGAAGGGCTTGAAAGCCTTCAGCCTAGATTCCT 

GTTACCAAGTCCCGGGTGAGTTGAGCTGCCTG*GTGGCAGAGATTCGAGCTCTGAGAGTGCAGTTGGAGC 

AG AGCATTCAAGTGAACAACCGT CTGCGGCTGCAG CTGGAACAGCAGATGGATCACGGTGCTGGCAAAG 

CCAGTCTCAGTTCCTGCCCTGTTAACCAGAGCTTCTCAGCCAAGGCGGAGCTGGCAAACCAGCAGCCAC 

CCTTCCAAGGTTCAGCTGCTTCCCCTCCAGTCCGGGACGTTGGCTTGAATTCTCCACCCGTGGTCCTCC 

CCAGCAATTCGTGCTCTGTTCCTGGCTCAGACTCTGCCATCATCAGTAGGACAAACAATGGTTCGGATG 

AGTCTGCAGCAACGAAGACCCCTCCCAAGATGGAGGTCGATGCTGCTGATGGCCCATTTGCCAGTGGAC 

ACGGCA3ACACGTCATCGGCCATGTGGATGACTACGACGCCCTACAGCAGCAGATTGGGGAAGGGAAGC 

TGCTGATCCAAAAGATACTGTCTCTCACGAGGCCAGCACGCAGCG7CCCTGCACTGGACGCGCAGGGCA 
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CAGAGGCACCAGGTACCAAAAGTGTCCATGAGCTTCGGAGCAGCGCCAGGGCTCTGAACCACAGCCTAG 
AAGAGTCAGCTTCCCTCCTCACCATGTTCTGGAGAGCAGCTTTGCCAAACTCTCATGGTTCTGTACTGG 
TAGGCGAAGAGGGAAACCTGATGGAGAAAGAACTCCTAGACCTGCGAGCCCAAGTGTCCCAACAGCAAC 
AGCTCCTTCAGAGCACTGCTGTGCGTCTGAAGACGGCCAACCAGAGGAAGAAAAGCATGGAGCAGTTCA 
TCGTGAGCCATCtGACCAGGACCCATGATGTCTTGAAGAAAGCACGGACTAATTTAGAGATGAAATCCT 
TCAGGGCCCTGATGTGCACTCCAGCCTTGTGACCCTTGCCTTCCAGGAGCCACATAAAAGGCGAAGCCA 
GGAGTCCTTAAAACAGCAGGAAGGGTGGGCCTGCCCGCCCCTAGTACAGCTGCCTGTCTGCTGAGGAAT 
ACCTGGTCCGACTCCTCCCCTGCTGGAGCTCCAGGGAAGGGCTCATATATGTGTCCACATGGGACAGGC 
AGGAAGGAAAGTGGCATCCTGACAATGAATATGATTAGCCAAGGCCCACTGGGCCCATCACTAAGCAAA 
ACTCATGTAGACTGTGTAGAAGGCCCCCCGGCACTGCTTCTAGACAGCCTCAGCAGCACGGTGCCCACC 
TCGTTAC AGTT CTCACCTCAAGAT AGCCAACTCAGGGGAACTAGGACCT TACCACfXACAAAC AGGATG 
TGTGGTCCCAATGCCAACGCTCCTCAGACAGTTGTAAAAGCACACATCATTGAGTGGCAGCGTCCAGCC 
GGACACTGTTGGAGACTACCAAACCCCTCACTGACCCAGTCTTGGGCCAGGCCAGCTCTGTGGGCCAAG 
TCTGGTAGTACTTTGGTCTCTACCACCACACCAGAGAGAGTCTATATAGCAAATGTGGTAACTTGTAGG 
TGCCCTGCACTTAGCCTAGCACCTTCTGTTTCTTACGTGATCTCAAGTTGAACCAACTTCCTTAACTCT 
GCTGTCCCCTGAATCCTAACTTCCCTCAGGGGAATTGGAGATTGGTGGCCACATCATGCCTATTGAATG 
TTTAGTGAACAGCATATCGGTGCCTCTTAATGGCATGGGCAAGGCCTGCTCTGTACTGAAGACTGTGTC 
TTCACAGTGCTCATAGGACGTGGGTGTGTGTATAAATGTATAATATAGATTTATATATGTCGCTATGGC 
TATGTGTTGAAGGCCAGCATAAGTGCAGAGCGATGGGTGAGAAGACGCTAAGCAGTCTTTCTTATGGCT 
ATTAAAGCTAACTGTGTAC (SEQ ID NO: 02) 
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MYOMEGALIN firstMET until stop 

ATGTC7MTGGATATCGCACTCTGTCCCAGCACCTCAATGACCTGAAGAAGGAGAACTTCAGCCTCAAG 
CTGCGCATCTACTTCCTGGAGGAGCGCATGCAACAGMGTATGAAGTCAGCCGGGAGGACGTCTACAAG 
CGGAACATTGAGCTGAAGGTTGAAGTGGAGAGCCTGAAACGAGAGCTCCAGGACAGGAAACAGCATCTA 
CATAAAACATGGGCCGATGAGGAGGATCTCAACAGCCAGAATGAAGCAGAGCTCCGGCGCCAGGTTGAA 
GAACCGCAGCAGGAGACAGAACACGTTTATGAGCTCCTAGACAACAACATTCAGCTGCTGCAGGAGGAA 
TCCAGGTTTGCAAAGGATGAAGCCACACAGATGGAGACTCTGGTGGAGGCAGAGAAGGGGTGTAATCTG 
GAGCTCTCAGAGAGGTGGAAGGATGC TACCAAGAACAGGGAAG ATGCACCGGGAGACC AGG TGAAGCTT 
GACCAATATTCTGCGGCACTGGC TCAGAGGGACAGGAGAATTG AAGAGC TGAGGCAGAGCT TG GCTGCC 
CAGGAGGGGCTTGTGGAACAGCTGTCTCGAGAGAAACAACAACTGTTACATCTGCTGGAGGAGCCTGGG 
GGCATGGAAGTGCAGCCCATGCCTAAAGGGTTACCCACGCAACAAAAGCCAGACCTAAATGAGACCCCT 
ACAACCCAGCCATCTGTGTCTGATTCCCACCTGGCAGAACTCCAGGACAAAATCCAGCAAACAGAGGTC 
ACCAAC AAGAT7C 7TCAAGAGAAACTGAATGACAT GAGCTG TGAGCTCAGATCTGCACAGGAG TCGTCT 
CAGAAGCAAGATACGACAATCCAAAGCCT CAAGG AAATGCT AAAGAGCAGGGAAAGTG AGAC TGAAGAG 
CTGTACCAGGTGATTGAAGGTCAAAATGACACAATGGCAAAGCTTCCGGAAATGCTACACCAGAGCCAG 
CTCGGACAGCTCCAGAGCTCAGAGGGCATTGCCCCTGCTCAGCAGCAAGTGGCCCTGCTTGACCTTCAG 
AGTGCTCTGTTCTGCAGCCAGCTTGAAATCCAGAAGCTCCAGAGGCTGTTACGCCAGAAAGAGCGTCAG 
"CTGGCTGACGGCAAGCGGTGCATGCAATTTGTGGAGGCTGCAGCACAGGAGAGAGAGCAGCAGAAGGAA 
GCTGCT TGG AAACATAACCAGGAATTACGAAAAGCTT TGCAACACCTCCAAGGAGAACTGCACAGTAAG 
AGCCAACAGCTCCACGTTCTGGAGGCAGAAAAATATAATGAAATTCGAACCCAGGGACAAAACATTCAA 
CACCTAAGTCACAGTCTGAGTCACAAAGAGCAGCTAATTCAGGAACTTCAGGAGCTCCTACAGTATCGG 
GATACCACAGACAAAACTCTAGACACAAATGAGGTGTTTCTTGAGAAACTACGGCAACGAATACAAGAC 
CGGGCAGTTGCTCTAGAGCGGGTTATAGATGAAAAGTTCTCTGCTCTAGAAGAAAAGGACAAGGAACTG 
CGGCAGCTCCGGCTTGCTGTGAGGGACCGAGACCATGACTTAGAGAGACTGCGTTGTGTCCTGTCTGCC 
AATGAAGCTACCATGCAAAGTATGGAGAGTCTCCTGAGGGCCAGAGGCCTGGAAGTGGAGCAGTTAATT 
GCCACCTGCCAAAACCTCCAGTGGTTGAAGGAAGAATTGGAAACCAAGTTTGGCCACTGGCAGAAGGAA 
CAGGAGAGCATCATTCAGCAGTTACAGACATCTCTGCATGACyVGGAACAAA^ 

GCAACTTTGCTCCACAAACTTGGACCCGGCCAGAGTGAAGTAGCTGAGGAGCTGTGCCAGCGCCTGCAG 

CGGAAGGAAAGGGTGCTGCAGGACCTTCTGAGTGATCGGAACAAACAAGCCATGGAGCACGAGATGGAG 

GTCCAGGGACTGCTCCAGTCGATGGGCACCCGGGAACAGGAAAGACAGGCTGTTGCAGAAAAAATGGTA 

CAAGCCTTCATGGAAAGAAACTCGGAATTACAGGCCCTGCGGCAGTATCTAGGGGGGAAGGAATTAATG 

GCAGCATCTCAGGCATTCATCTCTAACCAACCAGCTGGAGCGACTTCTGTAGGCCCCCACCATGGAGAG 

CAAACTGACCAAGGTTCTACGCAGATGCCCTCTCGAGACGACAGCACCTCGCTGACTGCCAGAGAGGAG 

GCCAGCATACCCCGGTCTACATTAGGAGACTCAGACACAGTTGCAGGGCTGGAGAAAGAACTGAGCAAT 

G CCAAGGAGGAGCT TGAGCTCATGGCCAAAAAAGAAAGAGAAAGCCAGATAGAAT TGTCTGCCCTGCAG 

TCCATGATGGCTGTGCAAGAGGAAGAGCTGCAGGTGCAGGCTGCTGACTTGGAGTCCCTGACCAGGAAC 

ATACAGATAAAAGAAGACCTCATAAAGGACCTGCAAATGCAACTGGTTGACCCTGAAGATATGCCAGCC 

ATGGAGCGCCTGACCCAAGAGGTCTTACTTCTTCGGGAAAAAGTTGCTTCAGTGGAACCCCAGGGTCAG 

GAAGGGTOVGAGAACAGGAGACAACAGTTGCTGCTGATGTTAGAAGGACTAGTGGATGAACGGAGTCGG 

CTCAACGAGGCCCTGCAAGCTGAGCGGCAGCTCTACAGCAGCCTGGTCAAGTTCCATGCCCAACCAGAG 

ATCTCTGAGAGAGACCGAACTCTGCAGGTGGAACTGGAAGGGGCCCAGGTGTTACGCAGTCGACTAGAA 

GAAGTTCTTGGAAGAAGCCTGGAGCGCTTAAGCAGGCTGGAGACCCTGGCCGCCATTGGAGGTGCTACT 

GCAGGCGATGAGACT GAAGATACAAGCACACAGT T CACAGACAG CATT GAGGAGGAGGCT GCACACAAC 

AGCCACCAGCAACTCATCAAGGTGTCTTTGGAGAAAAGCCTGACCACCATGGAGACCCAGAACACATGT 

CTTCAGCCCCCTTCCCCAGTAGGAGAGGATGGTAACAGGCATCTTCAGGAAGAAATGCTCCACCTGAGG 

GCTGAAATCCACCAGCCCTTAGAAGAGAAGAGAAAAGCTGAGGCAGAACTCAAGGAGCTAAAGGCTCAA 

ATTGAGGAAGCAGGAT TCTCCTCT GTGTCCCACATCAGGAACACCATGCTGAGCCTTTGCCTTTGCCTT 

GAGAATGCAGAGCTGAAAGAGC^VGATGGGAGAAGCAATGTCTGATGGATGGGAGGTGGAGGAAGACAAG 

GAGAAGGGCGAGGTGATGGTGGAGACCGTGGTGGCCAAAGGGGGTCTGAGTGAGGACAGCCTTCAGGCT 

GAGTTCAGGAAAGTCCAGGGGAGACTCAAGAGTGCCTACAACATCATCAACCTCCTCAAAGAGCAGCTG 

GTCCTGAGAAGCTCGGAAGGGAACACTAAGGAGATGCCAGAGTTCCTCGTGCGCCTGGCCAGGGAGGTG 

GACAGAATGAACATGGGCTTGCCTTCCTCGGAGAAGCATCAACACCAAGAACAGGAGAATATGACCGCA 

AGGCCTGGCCCCAGGCCCCAGAGTCTCAAGCTTGGGACAGCTCTCTCAGTAGACGGCTACCAACTGGAG 

AACAAGTCCCAGGCCCAAGACTCTGGACATCAGCCAGAATTTAGCCTACCAGGGTCCACCAAACACCTG 

CGCTCCCAGCTGGCTCAGTGTAGACAACGGTACCAAGATCTCCAGGAGAAGCTGCTCATCTCAGAAGCC 

ACTGTGTTTGCCCAGGCAAACCAGCTAGAGAAGTACAGAGCCATATTAAGTGAATCCCTGGTGAAGCAG 

GACAGCAAGCAGATCCAGGTGGACCTTCAGGACCTGGGCTATGAGACTTGTGGCCGAAGTGAGAATGAA 

GCTGAACGTGAGGAGACCACCAGCCCTGAGTGTGAGGAGCACGGTAACCTGAAGCCTGTGGTGCTGGTG 

GAAGGCTTGTGCTCTGAGCAAGGGTACCTGGACCCTGTCTTGGTCAGCTCACCTGTGAAGAACCCTT GG 

AGAACAAGCCAGGAAGCCAGAAGAATCCAGGCACAAGGAACTTCAGACAACAGCTCTCTCCT GAGGAAG 

GACATCCGAAATCTCAAAGCCCAGCTACCGAATGCCTACAAGGTCCTTCAGAACCTGA-GAGCGGGTC 
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CGGTCCCTGTCTGCCACAAGCGATTACTCATCGAGTCTGGAGAGACCCCGCAAGCTGATAGCCGTGGCA 

ACCCTTGAGGGGGCCTCACCCCACAGTGTCACTGATGAAGACGAAGGCTTGTTGTCAGATGGCACCGGG 

GCTTTTTACCCTCCAGGGCTCCAGGCCAAAAAGAATCTAGAGAATCTCATCCAGAGAGTATCCCAGCTG 

GAGGCCCAGCTCCCCAAAACTGGACTAGAAGGGAAGCTGGCTGAAGAACTGAAGTCCGCCTCGTGGCCT 

GGAAAATACGATTCTTTGATTCAGGATCAGGCCCGAAAAACTGTCATATCTGCGTCCGAAAATACXAAA 

AGGGAGAAGGATTTGTTTTCTTCTCACCCAACATTCGAAAGATACGTCAAATCTTTTGAAGACCTCCTG 

AGGAACAACGACTTGACTACTTACCTGGGCCAGAGCTTCCGGGAACAACTTAGTTCAAGGCGTTCAGTG 

ACAGACAGGCTGACCAGCAAATTCAGCACAAAGGATCATAAGAGTGAAAAAGAAGAAGTTGGGCTTGAG 

CCACTGGCCTTCAGGTTCAGCAGGGAATTACAGGAGAAAGAGAAAGTGATTGAAGTCCTGCAGGCCAAG 

GTGGATACCCGGTTTTTCTCACCCCCCAGCAGCCATGCTGCGTCTGAGTCCCACCGTTGTGCCAGCAGC 

ACATCTTTCCTGTCGGATGACATAGAAGCCTGCTCTGACATGGACGTAGCCAGCGAGTACACACACTAT 

GAAGAGAAGAAGCCCTCACCCAGTAACTCAGCAGCCAGTGCATCTCAGGGGCTTAAGGGCGAGCCCAGA 

AGCAGCTCCATCAGCTTGCCAACTCCCCAGAACCCCCCTAAGGAGGCCAGCCAGGCCCAGCCAGGCTTT 

CACTTTAACTCCATACCCAAGCCGGCTAGCCTTTCCCAGGCACCAATGCACTTCACTGTACCCAGCTTC 

ATGCCTTTCGGCCCCTCTGGGCCTCCCCTTCTTGGTTGCTGTGAGACACCAGTGGTGTCCTTGGCTGAG 

GCTCAACAAGAGCTGCAGATGCTGCAGAAGCAGCTGGGACGAAGTGTTAGCATTGCCCCTCCCACCTCC 

ACATCCACGTTGCTTAGCAACCACACAGAAGCTAGCTCTCCCCGCTACAGCAACCCTGCTCAGCCCCAC 

TCCCCAGCAAGGGGCACCATAGAGCTGGGCAGAATCCTGGAGCCTGGATACCTGGGCAGCGGCCAGTGG 

GACATGATGAGGCCTCAGAAAGGGAGCATCTCTGGGGAGCTGTCCTCAGGCTCCTCGATGTACCAGCTT 

AACTCCAAACCCACAGGGGCCGACCTGTTGGAAGAGCATTTAGGTGAGATCCGGAACCTGCGCCAGCGC 

CTGGAGGAGTCCATATGTGTCAATGACAGGCTACGGGAGCAGCTGCAGCATAGGCTCAGCTCCACGGCC 

CGAGAAAATGGTTCCACCTCTCACTTCTACAGTCAGGGCCTGGAGTCCATGCCTCAGCTCTACAATGAG 

AACAGAGCCCTCAGGGAAGAAAACCAAAGCCTGCAGACACGGCTCAGTCATGCTTCCAGGGGACACTCC 

CAGGAAGTGGACCACCTGAGGGAGGCTCTGCTTTCCTCAAGTTCCCAGCTCCAGGAGCTGGAGAAGGAG 

CTGGAGCAGCAGAAGGCTGAGCGGCGGCAGCTTCTGGAAGACTTGCAGGAGAAGCAGGATGAGATCGTG 

CATTTCCGAGAGGAGAGGCTGTCCCTCCAGGAAAACAACTCCAGGCTGCAGCACAAGCTGGCCCTCCTG 

CAACAACAGTGTGAGGAGAAACAGCAGCTCTCCCTGTCCCTGCAGTCAGAGCTCCAGATCTACGAGTCC 

CTCTACGAAAATCCTAAGAAGGGCTTGAAAGCCTTCAGCCTAGATTCCTGTTACCAAGTCCCGGGTGAG 

TTGAGCTGCCTGGTGGCAGAGATTCGAGCTCTGAGAGTGCAGTTGGAGCAGAGCATTCAAGTGAACAAC 

CGTCTGCGGCTGCAGCTGGAACAGCAGATGGATCACGGTGCTGGCAAAGCCAGTCTCAGTTCCTGCCCT 

GTTAACCAGAGCTTCTCAGCCAAGGCGGAGCTGGCAAACCAGCAGCCACCCTTCCAAGGTTCAGCTGCT 

TCCCCTCCAGTCCGGGACGTTGGCTTGAATTCTCCACCCGTGGTCCTCCCCAGCAATTCGTGCTCTGTT 

CCTGGCTCAGACTCTGCCATCATCAGTAGGACAAACAATGGTTCGGATGAGTCTGCAGCAACGAAGACC 

CCTCCCAAGATGGAGGTCGATGCTGCTGATGGCCCATTTGCCAGTGGACACGGCAGACACGTCATCGGC 

CATGTGGATGACTACGACGCCCTACAGCAGCAGATTGGGGAAGGGAAGCTGCTGATCCAAAAGATACTG 

TCTCTCACGAGGCCAGCACGCAGCGTCCCTGCACTGGACGCGCAGGGCACAGAGGCACCAGGTACCAAA 

AGTGTCCATGAGCTTCGGAGCAGCGCCAGGGCTCTGAACCACAGCCTAGAAGAGTCAGCTTCCCTCCTC 

ACCATGTTCTGGAGAGCAGCTTTGCCAAACTCTCATGGTTCTGTACTGGTAGGCGAAGAGGGAAACCTG 

ATGGAGAAAGAACTCCTAGACCTGCGAGCCCAAGTGTCCCAACAGCAACAGCTCCTTCAGAGCACTGCT 

GTGCGTCTGAAGACGGCCAACCAGAGGAAGAAAAGCATGGAGCAGTTCATCGTGAGCCATCTGACCAGG 

ACCCATGATGTCTTGAAGAAAGCACGGACTAATTTAGAGATGAAArCCTTCAGGGCCCTGATGTGCACT 

CCAGCCTTGTGA (SEQ ID NO: 03) 
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FIG. 4 
Human rayamagalin cOKA 

1 GGATCCTTGA GGGCACTGGT GCGACTTTCA GGTGAGGTCT TAGCAGATGA 
51 AAGCGGCTGG CTGTGGCCCG CGCCAGTAGT GCTTTCTGCT CCGCACTCGC 
101 CGTGAGCCAG GTGTGCAACC GGATTTGGGG CGAGGGTCGC GCTGGCTACC 
151 TCGCATGCGC AGAGCCGGAA GCCCGCTGAC CGGACTACAG CTCCCAGAAG 
201 AGCCTTGTGG AGGCCGCAGA CGCGAAGCCG CTGGCGCCAT CTTGAAATCT 
251 GATCCTCCAT CCCCGAGGCT TTGCGTCTGC GCGGCCGGCC GCTGCTGCTC 
301 CGGGAGCCCA GTCTGCTAAA AGGGGAGGAC GTTGAGGACG CGGCGGCTGG 
351 CGGGAGAGAC AGCTGGGGAG AGACATGGCA GGGTCGGAGC GCGGCCTGCG 
4 01 CCTCTGTCAC TCAGCATCCT CTTAGGCGTT TCCACGCCCG CCCCCTGCCC 
4 51 GAGGGGCGGG GCTGACGGCT CTGGTACCCG GAGTCGGCGC GCGGGGCAGG 
501 GGCGCGCCCC TGCAGAGTGG GGACCCCACT GGGCTGTGCC ATGCTGACCG 
551 GAGACCACCG AGGCGGGAGA CAGAGCGCGG CGAAGAGCCA TTGAGTGGTC 
601 ACCCAGTAGC CGCCGCCGCC GCCGCCTCGG GAAGCTTGCC ACCCGCTAGG 
€51 AGGGAAGATG AAGGAGATTT GCAGGATCTG TGCCCGAGAG CTGTGTGGAA 
701 ACCAGCGGCG CTGGATCTTC CACACGGCGT CCAAGCTCAA TCTCCAGGTT 
751 CTGCTTTCGC ACGTCTTGGG CAAGGATGTC CCCCGCGATG GCAAAGCCGA 
801 GTTCGCTTGC AGCAAGTGTG CTTTCATGCT TGATCGAATC TATCGATTCG 
851 ACACAGTTAT TGCCCGGATT GAAGCGCTTT CTATTGAGCG CTTGCAAAAG 
901 CTGCTACTGG AGAAGGATCG CCTCAAGTTC TGCATTGCCA GTATGTATCG 
951 GAAGAATAAC GATGACTCTG GCGCGGAGAT CAAGGCGGGG AATGGGACGG 
1001 TTGACATGTC CGTCTTACCC GATGCGAGAT ACTCTGCACT GCTCCAGGAG 
1051 GACTTCGCCT ATTCAGGGTT TGAGTGCTGG GTGGAGAATG AGGATCAGAT 
1101 CCAGGAGCCA CACAGCTGCC ATGGTTCAGA AGGCCCTGGA AACCGACCCA 
1151 GGAGATGCCG TGGTTGTGCC GCTTTGCGGG TTGCTGATTC TGACTATGAA 
1201 GCCATTTGTA AGGTACCTCG AAAGGTGGCC AGAAGTATCT CCTGCGGCCC 
1251 TTCTAGCAGG TGGTCGACCA GCATTTGCAC TGAAGAACCA GCGTTGTCTG 
1301 AGGTTGGGCC ACCCGACTTA GCAAGCACAA AGGTACCCCC AGATGGAGAA 
1351 AGCATGGAGG AAGAGACGCC TGGTTCCTCT GTGGAATCTT TGGATGCAAG 
1401 CGTCCAGGCT AGCCCTCCAC AACAGAAAGA TGAGGAGACT GAGAGAAGTG 
1451 CAAAGGAACT TGGAAAGTGT GACTGTTGTT CAGATGATCA GGCTCCGCAG 
1501 CATGGGTGTA ATCACAAGCT GGAATTAGCT CTTAGCATGA TTAAAGGTCT 
1551 TGATTATAAG CCCATCCAGA GCCCCCGAGG GAGCAGGCTT CCGATTCCAG 
1601 TGAAATCCAG CCTACCTGGA GCCAAGCCTG GCCCTAGCAT GACAGATGGA 
1651 GTTAGTTCCG GTTTCCTTAA CAGGTCTTTG AAACCCCTTT ACAAGACACC 
1701 TGTGAGTTAT CCCTTGGAGC TTTCAGACCT GCAGGAGCTG TGGGATGATC 
1751 TCTGTGAAGA TTATTTGCCG CTCCGGGTCC AGCCCATGAC TGAAGAGTTG 
1801 CTGAAACAAC AAAAGCTGAA TTCACATGAG ACCACTATAA CTCAGCAGTC 
1851 TGTATCTGAT TCCCACTTGG CAGAACTCCA GGAAAAAATC CAGCAAACAG 
1901 AGGCCACCAA CAAGATTCTT CAAGAGAAAC TTAAXGAAAT GAGCTATGAA 
1951 CTAAAGTGTG CTCAGGAGTC GTCTCAAAAG CAAGATGGTA CAATTCAGAA 
2001 CCTCAAGGAA ACTCTGAAAA GCAGGGAACG TGAGACTGAG GAGTTGTACC 
2051 AGGTAATTGA AGGTCAAAAT GACACAATGG CAAAGCTTCG AGAAATGCTG 
2101 CACCAAAGCC AGCTTGGACA ACTTCACAGC TCAGAGGGTA CTTCTCCAGC 
2151 TCAGCAACAG GTAGCTCTGC TTGATCTTCA GAGTGCTTTA TTCTGCAGCC 
2201 AACTTGAAAT ACAGAAGCTC CAGAGGGTGG TACGACAGAA AGAGCGCCAA 
2251 CTGGCTGATG CCAAACAATG TGTGCAATTT GTAGAGGCTG CAGCACACGA 
2301 GAGTGAACAG CAGAAAGAGG CTTCTTGGAA ACATAACCAG GAATTGCGAA 
2351 AAGCCTTGCA GCAGCTACAA GAAGAATTGC AGAATAAGAG CCAACAGCTT 
2401 CGTGCCTGGG AGGCTGAAAA ATACAATGAG ATTCGAACCC AGGAACAAAA 
2451 CATCCAGCAC CTAAACCATA GTCTGAGTCA CAAGGAGCAG TTGCTTCAGG 
2501 AATTTCGGGA GCTCCTACAG TATCGAGATA ACTCAGACAA AACCCTTGAA 
2551 GCAAATGAAA TGTTGCTTGA GAAACTTCGC CAGCGAATAC ATGATAAAGC 
2601 TGTTGCTCTG GAGCGGGCTA TAGATGAAAA ATTCTCTGCT CTAGAAGAGA 
2651 AAGAAAAAGA ACTGCGCCAG CTTCGTCTTG CTGTGAGAGA GCGAGATCAT 
2701 GACTTAGAGA GACTGCGCGA TGTCCTCTCC TCCAATGAAG CTACTATGCA 
27 51 AAGTA7GGAG AGTCTCCTGA GGGCCAAAGG CCTGGAAGTG GAACACTTAT 
2801 CTACTACCTG TCAAAACCTC CAGTGGCTGA AAGAAGAAAT GGAAACCAAA 
2851 TTTAGCCGTT GGCAGAAGGA ACAAGAGAGT ATCATTCAGC AGTTACAGAC 
2901 GTCTCTTCAT GATAGGAACA AAGAAGTGGA GGATCTTAGT CCAACACTGC 
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2951 TCTGCAAACT TGGACCAGGG CAGAGTGAGA TAGCAGAGGA GCTGTGCCAG 
3001 CGTCTACAGC GAAAGGAAAG GATGCTGCAG GACCTTCTAA GTGATCGAAA 
3051 TAAACAAGTG CTGGAACATG AAATGGAGAT TCAAGGCCTG CTTCAGTCTG 
3101 TGAGCACCAG GGAGCAGGAA AGCCAAGCTG CTGCAGAGAA GTTGGTGCAA 
3151 GCCTTAATGG AAAGAAATTC AGAATTACAG GCCCTGCGCC AATATTTAGG 
3201 AGGGAGAGAC TCCCTGATGT CCCAAGCACC CATCTCTAAC CAACAAGCTG 
3251 AAGTTACCCC CACTGGCCGT CTTGGAAAAC AGACTGATCA AGGTTCAATG 
3301 CAGATACCTT CCAGAGATGA TAGCACTTCA TTGACTGCCA AAGAGGATGT 
3351 CAGCATACCC AGATCCACAT TAGGAGACTT GGACACAGTT GCAGGGCTGG 
3401 AAAAAGAACT GAGTAATGCC AAAGAGGAAC TTGAACTCAT GGCTAAAAAA 
3451 GAAAGAGAAA GTCAGATGGA ACTTTCTGCT CTACAGTCCA TGATGGCTGT 
3501 GCAGGAAGAA GAGCTGCAGG TGCAGGCTGC TGATATGGAG TCTCTGACCA 
3551 GGAACATACA GATTAAAGAA GATCTCATAA AGGACCTGCA AATGCAACTG 
3601 GTTGATCCTG AAGACATACC AGCTATGGAA CGCCTGACCC AGGAAGTCTT 
3651 ACTTCTTCGG GAAAAAGTTG CTTCAGTAGA ATCCCAGGGT CAAGAAATTT 
3701 CAGGAAACCG AAGACAACAG TTGCTGCTGA TGCTAGAAGG ACTAGTAGAT 
3751 GAACGGAGTC GGCTCAATGA GGCCTTACAA GCAGAGAGAC AGCTCTATAG 
3801 CAGTCTGGTG AAGTTCCATG CCCATCCAGA GAGCTCTGAG AGAGACCGAA 
3851 CTCTGCAGGT GGAACTGGAA GGGGCTCAGG TGTTACGCAG TCGGCTAGAA 
3901 GAAGTTCTTG GAAGAAGCTT GGAGCGCTTA AACAGGCTGG AGACCCTGGC 
3951 CGCCATTGGA GGTGCAGCTG CAGGGGATGA CACCGAAGAT ACAAGCACTG 
4001 AGTTCACTGA CAGTATTGAG GAGGAGGCTG CACACCATAG TCACCAGCAA 
4051 CTTGTCAAGG TGGCTTTGGA GAAAAGTCTG GCAACTGTGG AGACCCAGAA 
4101 CCCATCTTTT TCCCCTCCTT CTCCGATGGG AGGGGACAGT AACAGGTGTC 
4151 TTCAGGAAGA AATGCTCCAC CTGAGGGCTG AGTTCCACCA GCACTTAGAA 
4201 GAGAAGAGGA AAGCTGAGGA GGAACTGAAG GAGCTAAAGG CTCAAATTGA 
4251 GGAAGCAGGA TTCTCCTCAG TGTCCCACAT CAGGAACACC ATGCTGAGCC 
4301 TTTGCCTTGA GAATGCGGAG CTGAAAGAGC AGATGGGAGA AGCAATGTCT 
4351 GATGGATGGG AGATCGAGGA AGACAAGGAG AAGGGCGAGG TGATGGTTGA 
4401 GACTGTGGTA ACCAAAGAGG GTCTGAGTGA GAGTAGCCTT CAGGCTGAGT 
4451 TCAGAAAGCT CCAGGGAAAA CTGAAGAATG CCCACAATAT CATCAACCTC 
4501 CTCAAAGAAC AACTTGTGCT GAGTAGCAAG GAAGGGAATA GTAAACTTAC 
4551 TCCAGAGCTC CTTGTGCATC TGACCAGCAC CATTGAAAGA ATAAACACAG 
4 601 AACTGGTTGG TTCCCCTGGG AAGCACCAAC ACCAAGAGGA GGGGAATGTG 
4 651 ACTGTGAGGC CTTTCCCCAG ACCCCAGAGC CTTGACCTTG GGGCTACCTT 
4701 CACAGTGGAT GCCCACCAAT TGGATAACCA GTCCCAGCCT CGTGACCCTG 
4751 GGCCTCAGTC AGCGTTTAGC CTACCAGGGT CCACCCAGCA CCTGCGCTCC 
4801 CAGCTGTCAC AATGCAAACA ACGCTATCAA GATCTCCAGG AGAAGCTGCT 
4851 GCTATCAGAA GCCACTGTCT TTGCTCAGGC TAACGAGCTG GAGAAATACA 
4901 GAGTTATGCT TACAGGTGAA TCCTTGGTGA AGCAGGACAG CAAGCAGATC 
4951 CAGGTGGACC TCCAGGACCT GGGCTATGAG ACTTGTGGCC GAAGCGAGAA 
5001 TGAGGCTGAA CGGGAGGAAA CCACCAGTCC TGAGTGTGAG GAGCACAACA 
5051 GCCTCAAGGA AATGGTCCTG ATGGAGGGGC TGTGCTCTGA GCAGGGACGC 
5101 CGGGGCTCAA CACTGGCTAG TTCCTCTGAG AGGAAGCCCT TGGAGAACCA 
5151 GCTAGGGAAG CAGGAAGAGT TCCGGGTATA TGGAAAGTCA GAAAACATCT 
5201 TGGTCCTACG AAAGGACATC AAAGATCTGA AGGCCCAGCT GCAGAATGCC 
5251 AACAAGGTCA TTCAAAACCT CAAGAGCCGG GTCCGGTCCC TCTCAGTTAC 
5301 AAGTGATTAT TCGTCTAGTC TGGAAAGACC CCGGAAGCTG AGAGCTGTTG 
5351 GCACCTTGGA GGGGTCTTCA CCTCATAGTG TCCCTGATGA GGATGAGGGG 
54C1 TGGCTGTCTG ATGGCAC7GG GGCTTTCTAC TCTCCAGGGC TTCAGGCCAA 
5451 AAAGGACCTG GAGAGTCTCA TCCAGAGAGT ATCCCAGCTG GAGGCCCAGC 
5501 TCCCAAAAAA TGGACTAGAA GAGAAGCTGG CTGAGGAGCT GAGATCAGCC 
55S1 TCGTGGCCTG GGAAATATGA TTCCCTGATT CAGGATCAGG CCCGGGAACT 
5601 GTCTTACCTA CGGCAAAAAA TACGAGAAGG GAGAGGTATT TGTTATCTTA 
5651 TCACCCGGCA TGCAAAAGAT ACAGTAAAAT CTTTTGAGGA TCTCCTAAGG 
57C1 AGCAATGACA TTGACTACTA CCTGGGACAG AGCTTCCGGG AGCAACTCGC 
5751 CCAGGGAAGC CAGCTGACAG AGAGGCTCAC CAGCAAACTC AGCACCAAGG 
56C1 ATCATAAAAG TGAGAAAGAT CAAGCTGGAC TTGAGCCACT GGCCCTCAGG 
53=1 CTCAGCAGGG AGCTGCAGGA GAAGGAGAAA GTGATTGAAG TCCTGCAGGC 
59:*. CAACCTGGAT GCTCGGTCCC TCACACCCTC CAGCAGCCAT GCCTTGTCTG 
5951 ACTCCCACCG CTCTCCCAGC AGCACCTCTT TCCTGTCTGA TGAACTGGAA 
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6001 GCCTGCTCTG ACATGGACAT AGTCAGCGAG TACACACACT ATGAAGAGAA 
6051 GAAAGCTTCT CCCAGTCACT CAGATTCCAT CCATCATTCG AGTCATTCTG 
6101 CTGTGTTGTC TTCTAAACCA TCATCAACCA GTGCATCTCA GGGGGCTAAG 
6151 GCCGAATCCA ACAGCAACCC CATCAGCTTG CCAACTCCCC AGAATACCCC 
6201 CAAGGAGGCC AACCAGGCCC ATTCAGGCTT TCATTTTCAC TCCATACCCA 
6251 AGCTGGCTAG CCTTCCTCAG GCACCATTGC CCTCAGCTCC ATCCAGCTTC 
6301 CTGCCTTTCA GCCCCACTGG CCCTCTCCTC CTTGGCTGCT GTGAGACACC 
6351 AGTGGTCTCC TTGGCTGAGG CTCAGCAGGA GCTACAGATG CTGCAGAAGC 
6401 AGTTGGGAGA AAGTGCCAGC ACTGTTCCTC CTGCTTCCAC AGCTACATTG 
6451 CTGAGCAACG ACTTGGAAGC CGACTC7TCC TACTACCTCA ACTCTGCCCA 
6501. GCCTCACTCT CCTCCAAGGG GCACCATAGA ACTGGGAAGA ATCCTAGAGC 
6551 CTGGGTACCT GGGCAGCAGT GGCAAGTGGG ATGTGATGAG GCCTCAGAAA 
6601 GGGAGTGTAT CTGGGGACCT ATCCTCAGGC TCCTCTGTGT ACCAGCTTAA 
6651 CTCCAAACCC ACAGGGGCTG ACCTGCTGGA AGAGCATCTT GGTGAAATCC 
6701 GGAACCTGCG CCAGCGCCTG GAGGAGTCCA TCTGCATCAA TGACCGCCTA 
6751 CGGGAGCAAC TGGAACACCG GCTGACCTCT ACTGCTCGTG GAAGGGGATC 
6801 CACTTCTAAC TTCTACAGTC AGGGCCTGGA GTCCATACCT CAGCTCTGCA 
6851 ATGAGAACAG AGTCCTCAGG GAAGACAATC GAAGACTTCA GGCTCAACTG 
6901 AGTCATGTTT CCAGAGAGCA CTCCCAGGAA ACAGAAAGCC TGAGGGAGGC 
6951 TCTGCTGTCC TCTCGATCCC ACCTTCAAGA GCTGGAAAAG GAGCTGGAGC 
7001 ACCAGAAGGT GGAAAGGCAG CAGCTTTTGG AAGACTTGAG GGAGAAGCAG 
7051 CAAGAGGTCT TGCATTTCAG GGAGGAACGT CTTTCCCTCC AGGAAAACGA 
7101 CTCCAGTGGG CCTTGCCTCT CCCTGGTCAG ACTGCAGCAC AAGCTGGTTC 
7151 TCCTGCAGCA ACAGTGTGAA GAGAAACAGC AGCTCTTTGA GTCCCTCCAG 
7201 TCAGAGCTAC AAATCTACGA GGCACTTTAT GGCAATTCCA AGAAGGGGCT 
7251 GAAAGCTTAC AGCCTGGATG CCTGTCACCA AATCCCTTTG AGCAGTGACC 
7301 TGAGCCACCT GGTGGCAGAG GTACGAGCTC TGAGAGGGCA GCTGGAGCAG 
7351 AGCATTCAGG GGAACAATTG TCTGCGACTG CAGCTGCAAC AGCAGCTGGA 
74 01 GAGCGGTGCT GGCAAAGCCA GCCTCAGCCC CTCCTCCATT AACCAGAACT 
7451 TCCCAGCCAG CACTGACCCT GGAAACAAGC AGCTGCTCCT CCAAGATTCA 
7501 GCTGTGTCCC CTCCAGTCCG GGATGTTGGT ATGAATTCCC CAGCTCTGGT 
7551 CTTCCCCAGC TCTGCTTCCT CTACTCCTGG CTCAGAAACG CCCATAATCA 
7601 ACAGAGCAAA TGGCTTGGGT TTGGATACTT CTCCAGTAAT GAAGACCCCT 
7651 CCCAAGCTAG AGGGTGATGC TACTGATGGC TCCTTTGCCA ATAAGCATGG 
7701 CCGCCATGTC ATTGGCCACA TTGATGACTA CAGTGCCCTA AGACAGCAGA 
7751 TTGCGGAGGG CAAGCTGCTG GTCAAAAAGA TAGTGTCTCT TGTGAGATCA 
7801 GCGTGCAGCT TCCCTGGCCT TGAAGCCCAA GGCACAGAGG TGCTAGGCAG 
7851 CAAAGGTATT CATGAGCTTC GGAGCAGCAC CAGTGCCCTG CACCATGCCC 
7901 TAGAGGAGTC GGCTTCCCTC CTCACCATGT TCTGGAGAGC AGCCCTGCCA 
7951 AGCACCCACA TCCCTGTGCT GCCTGGCAAA GTGGGAGAAT CAACAGAAAG 
8001 GGAACTTCTG GAACTGAGAA CCAAAGTATC CAAACAGGAG CGGCTCCTTC 
8051 AGAGCACAAC TGAGCATCTG AAGAACGCCA ACCAGCAGAA GGAGAGCATG 
8101 GAGCAGTTCA TCGTCAGCCA GCTAACCAGA ACACATGATG TTTTAAAGAA 
8151 GGCAAGGACT AACTTAGAGG TGAAATCCCT AAGGGCTCTG CCATGTACTC 
8201 CAGCCTTGTG ACCCTTGCCT TCCAGGAACC ATGCAAGAAG CGCAGCCACC 
8251 AGAAGTCCTT AAAACAGCAG GAAAGGTGGG CCTGTCCCCC TTTTGTGCAG 
8301 CTACCTATCT GCTGAGGAGC ATCTGGGCCT CATTCCTCCA AGTCCACGGG 
8351 AGGGTCCAGA AGAGGGAGTC AGAGATGTAT CCTGGTGGAG CTGGGAGAAA 
8401 GGCAGAAAGC CTTTCTGACA GCTATGGAAT ACGATTAGCC AAGGTCCAGT 
8451 TGGCCCAGCA CTAAGAAAAA GATGCGTAGT TTGCACAGAA GGTTTTGTGA 
8501 TCCTGCCTCT CAACAGCCCC AGCAGCTTGG GAACTAGCAA GAGCACATTT 
8551 CTTGCCTCAT CAGCTGTCCT GAGATGGAAA ACTCAGTGGA TATAGGACCC 
8601 TGATTCCGAT GAAAGGGGCA CGTGGTCCCA ATGCTGGAGC TCCTCTGGCA 
8651 GGTTCTAAAA GCACACTACT GAGCAGCGGT GCCCTGCCGG ACACTGCTGG 
8701 CGGGGGCTCA GTGAGCACTA CTCACAGATC CACACCTGAC CCTGTTGGGT 
8751 CGAGTCAGGC TGGGCCTTGG TCTGCACTGT AGCACCTGTG TTCTTTGAGT 
8801 TCACATCATG AATGTGGTGA CTTCCCAGAT ACCATCTCAG GCTTAACCTA 
8351 GCACA7CCTA TTTCTTTTCT 7CTATGATA? CCAAATT3GA CTGACCTCAC 
8901 TTCAAAGTTG CTGTCCCATT TTGTCACCCT ATCTTATCTC GGGGAAATTG 
8951 CAGACTGATG GCCAGACCAA CTCTGTTGAA AT^'CTTGCAT AGAGCAAACC 
9001 TGTGCTCATT TTTAAGTGGC ATGGGAGAGG CCCCCAGCCT AGTAAAGCCT 
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9051 AGTCTGTGTC TTCACAGTGC TGGTAGAATG TGTTTGTGTG TATAAATATA 

9101 T GAT AT AG AT TTATATATGT TGCTAACGCC ATATATTGAA GGCCAACATA 

9151 ACTGGTGGAC AGGGTGGGTG ACAGAAAATG AAAGCCTTTT TGGTGATTGT 

9201 TAAAGCAAGA TGTGTATAAA GAAATAAATA GTTTTTCTTT C (SZQ ID NO: 04) 
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>Human myomegalin protein 

1 MKEICRICAR ELCGNQRRWI FHTASKLNLQ VLLSHVLGKD VPRDGKAEFA 
51 CSKCAFMLDR IYRFDTVIAR IEALSIERLQ KLLLEKDRLK FCIASMYRKN 
101 NDDSGAEIKA GNGTVDMSVL PDARYSALLQ EDFAYSGFEC WVENEDQIQE 
151 PHSCHGSEGP GNRPRRCRGC AALRVADSDY EAICKVPRKV ARSISCGPSS 
201 RWSTSICTEE PALSEVGPPD LASTKVPPDG ESMEEETPGS SVESLDASVQ 
251 ASPPQQKDEE TERSAKELGK CDCCSDDQAP QHGCNHKLEL ALSMIKGLDY 
301 KPIQSPRGSR LPIPVKSSLP GAKPGPSMTD GVSSGFLNRS LKPLYKTPVS 
351 YPLELSDLQE LWDDLCEDYL PLRVQPMTEE LLKQQKLNSH ETTITQQSVS 
401 DSHLAELQEK IQQTEATNKI LQEKLNEMSY ELKCAQESSQ KQDGTIQNLK 
4 51 ETLKSRERET EELYQVIEGQ NDTMAKLREM LHQSQLGQLH SSEGTSPAQQ 
501 QVALLDLQSA LFCSQLEIQK LQRWRQKER QLADAKQCVQ FVEAAAHESE 
551 QQKEASWKHN QELRKALQQL QEELQNKSQQ LRAWEAEKYN EIRTQEQNIQ 
601 HLNHSLSHKE QLLQEFRELL QYRDNSDKTL EANEMLLEKL RQRIHDKAVA 
651 LERAIDEKFS ALEEKEKELR QLRLAVRERD HDLERLRDVL SSNEATMQSM 
701 ESLLRAKGLE VEQLSTTCON LQWLKEEMET KFSRWQKEQE SIIQQLQTSL 
7 51 HDRNKEVEDL SATLLCKLGP GQSEIAEELC QRLQRKERML QDLLSDRNKQ 
801 VLEHEMEIQG LLQSVSTREQ ESQAAAEKLV QALMERNSEL QALRQYLGGR 
851 DSLMSQAPIS NQQAEVTPTG RLGKQTDQGS MQIPSRDDST SLTAKEDVSI 
901 PRSTLGDLDT VAGLEKELSN AKEELELMAK KERESQMELS ALQSMMAVQE 
951 EELQVQAADM ESLTRNIQIK EDLIKDLQMQ LVDPEDIPAM ERLTQEVLLL 
1001 REKVASVESQ GQEISGNRRQ QLLLMLEGLV DERSRLNEAL QAERQLYSSL 
1051 VKFHAHPESS ERDRTLQVEL EGAQVLRSRL EEVLGRSLER LNRLETLAAI 
1101 GGAAAGDDTE OTSTEFTDSI EEEAAHHSHQ QLVKVALEKS LATVETQNPS 
1151 FSPPSPMGGD SNRCLQEEML HLRAEFHQHL EEKRKAEEEL KELKAQIEEA 
1201 GFSSVSHIRN TMLSLCLENA ELKEQMGEAM SDGWEIEEDK EKGEVMVETV 
1251 VTKEGLSESS LQAEFRKLQG KLKNAHNIIN LLKEQLVLSS KEGNSKLTPE 
1301 LLVHLTSTIE RINTELVGSP GKHQHQEEGN VTVRPFPRPQ SLDLGATFTV 
1351 DAHQLDNQSQ PRDPGPQSAF SLPGSTQHLR SQLSQCKQRY QDLQEKLLLS 
1401 EATVFAQANE LEKYRVMLTG ESLVKQDSKQ IQVDLQDLGY ETCGRSENEA 
1451 EREETTSPEC EEHNSLKEMV LMEGLCSEQG RRGSTLASSS ERKPLENQLG 
1501 KQEEFRVYGK SENILVLRKD IKDLKAQLQN ANKVIQNLKS RVRSLSVTSD 
1551 YSSSLERPRK LRAVGTLEGS SPHSVPDEDE GWLSDGTGAF YSPGLQAKKD 
1601 LESLIQRVSQ LEAQLPKNGL EEKLAEELRS ASWPGKYDSL IQDQARELSY 
1651 LRQKIREGRG ICYLITRHAK DTVKSFEDLL RSNDIDYYLG QSFREQLAQG 
1701 SQLTERLTSK LSTKDHKSEK DQAGLEPLAL RLSRELQEKE KVIEVLQAKL 
1751 DARSLTPSSS HALSDSHRSP SSTSFLSDEL EACSDMDIVS EYTHYEEKKA 
1801 SPSHSDSIHH SSHSAVLSSK PSSTSASQGA KAESNSNPIS LPTPQNTPKE 
1851 ANQAHSGFHF HSIPKLASLP QAPLPSAPSS FLPFSPTGPL LLGCCETPW 
1901 SLAEAQQELQ MLQKQLGESA STVPPASTAT LLSNDLEADS SYYLNSAQPH 
1951 SPPRGTIELG RILEPGYLGS SGKWDVMRPQ KGSVSGDLSS GSSVYQLNSK 
2001 PTGADLLEEH LGEIRNLRQR LEESICINDR LREQLEHRLT STARGRGSTS 
2051 NFYSQGLESI PQLCNENRVL REDNRRLQAQ LSHVSREHSQ ETESLREALL 
2101 SSRSHLQELE KELEHQKVER QQLLEDLREK QQEVLHFREE RLSLQENDSS 
2151 GPCLSLVRLQ HKLVLLQQQC EEKQQLFESL QSELQIYEAL YGNSKKGLKA 
2201 YSLDACHQIP LSSDLSHLVA EVRALRGQLE QSIQGNNCLR LQLQQQLESG 
2251 AGKASLSPSS INQNFPASTD PGNKQLLLQD SAVSPPVRDV GMNSPALVFP 
2301 SSASSTPGSE TPIINRAKGL GLOTSPVMKT PPKLEGDATD GSFANKHGRH 
2351 VIGHIDDYSA LRQQIAEGKL LVKKIVSLVR SACSFPGLEA QGTEVLGSKG 
2401 IHELRSSTSA LHHALEESAS LLTMFWRAAL PSTHIPVLPG KVGESTEREL 
24 51 LELRTKVSKQ ERLLQSTTEH LKNANQQKES MEQFIVSQLT RTHDVLECKAR 
2501 TNLEVKSLRA LPCTPAL (SEQ ID NO: 05) 
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FIGURE 6 

M14 PROTEIN 

MMAQFPTAMNGGPNMWAITSEERTKHDKQE'DNLKPSGGYITGDQARTFFLQSGLPAPVL 

AE I WALS DLNKDGKMDQQE FS I AMKL IKLKLQGQQLP VVL P P IMKQ P PM FS PLI SARFG 

MGSMPNLSIHQPLPPVAPITAPLSSATSGTSIPPLMMPAPLVPSVSTSSLPNGTASLIQ 

PLSIPYSSSTLPHASSYSLMMGGFGGASIQKAQSLIDLGSSSSTSSTASLSGNSPKTGT 

SEWAVPQPSRLKYRQKFNSLDKSMSGYLSGFQARNALLQSNLSQTQLATIWTLADIDGD 

GQLKAEEFILAMHLTDMAKAGQPLPLTLPPELVPPSFRGGKQIDSINGTLPSYQKTQEE 

EPQKKL PVTFEDKRKANYERGNMELEKRRQVLMEQQQREAERKAQKEKEEWERKQRELQ 

EQEWKKQLELEKRLEKQRELERQREEERRKEIERREAAKQELERQRRLEWERIRRQELL 

NQKNREQEEIVRLNSKKKSLHLELEAVNGKHQQISGRLQDVRIRKQTQKTELEVLDKQC 

DLEIMEIKQLQQELQEYQNKLIYLVPEKQLLNERIKNMQLSNTPDSGISLLHKKSSEKE 

ELCQRLKEQLDALEKETASKLSEMDSFNNQLKCGNMDDSVLQCLLSLLSCLNNLFLLLK 

ELRESYNTQQLALEQLHKIKRDKLKELERKRLEQIQKKKLEDEAARKAKQGKENLWKES 

IRKEEEEKQKRLQEEKSQDRTQEEERKTEAKQSETARALVNYRALYPFEARNHDEMSFN 

SGDIIQVDEKTVGEPGWLYGSFQGKFGWFPCNYVEKMLSSDKTPSPKKALLPPAVSLSA 

TSAAPQPLCSNQPAPVTDYQNVSFSNLNVNTTWQQKSAFTRTVSPGSVSPIHGQGQAVE 

NLKAQALCSWTAKKENHLNFSKHDVITVLEQQENWWFGEVHGGRGWFPKSYVKIIPGSE 

VKRGEPEALYAAVNKKPTSTAYPVGEEYIALYSYSSVEPGDLTFTEGEELLVTQKDGEW 

WTGSIGERTGIFPSNYVRPKDQENVGNASKSGASNKKPEIAQVTSAYAASGAEQLSLAP 

GQLILILKKNSSGWWQGELQARGKKRQKGWFPASHVKLLGPSAERTTPAFHAVCQVIAM 

YDYIANNEDELNFSKGQLINVMNKDDPDWWQGEINGVTGLFPSNYVBCMTTDSDPSQQWC 

ADLQALDTMQPMERKRQGYIHELIETEERYMDDLQLVIEVFQKRMAESGFLTEAEMALI 

FVNWKELIMSNTKLLKALRVRKKTGGEKMPVEMMGDILAAELSHMQAYIRFCSCQLNGA 

ALLQQKTDEDADFKEFLKKLASDPRCKGMPLSSFLLKPMQRITRYPLLIRSILENTPQN 

HVDHSSLKLALERAEELCSQVNEGVREKENSDRLEWIQAHVQCEGLAEQLIFNSLTNCL 

GPRKLLYSGKLYKTKSNKELHGFLFNDFLLLTYLVRQFAASSGFEKLFSSKSSAQFKMY 

KTPIFLNEVLVKLPTDPSSDEPVFHISHIDRVYTLRTDNINERTAWVQKIKAASEQYID 

TEKKKREKAYQARSQKTSGIGRLMVHVIEATELKACKPNGKSNPYCEISMGSQSYTTRT 

LQDTLNPKWNFNCQFFIKDLYQDVLCLTMFDRDQFSPDDFLGRTEVPVAKIRTEQESKG 

PTTRRLLLHEVPTGEVWVRFDLQLFEQKTLL {SEQ ID NO: 08) 
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SEQUENCE LISTING 



<110> Conti, Marco 

Pahlke, Gudrun 



<120> Novel Phosphodiesterase Interacting 
Proteins 

<130> SUN-101PCT 

<140> 60/108,255 
<141> 1998-11-12 

<160> 8 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 2326 
<212> PRT 
<213> rat 



<4 00> 1 
Met Ser 
1 

Lys Glu 

Met Gin 

He Glu 

50 
Arg Lys 
65 

Ser Gin 

Glu Thr 

Gin Glu 

Leu Val 
130 
Lys Asp 
145 

Leu Asp 

Glu Leu 

Ser Arg 

Met Glu 
210 
Asp Leu 
225 

Leu Ala 



Asn Gly 

Asn Phe 

20 
Gin Lys 
35 

Leu Lys 

Gin His 

Asn Glu 

Glu His 
100 
Glu Ser 
115 

Glu Ala 

Ala Thr 

Gin Tyr 

Arg Gin 
180 
Glu Lys 
195 

Val Gin 



Tyr Arg Thr 
5 

Ser Leu Lys 

Tyr Glu Val 

Val Glu Val 
55 

Leu His Lys 
70 

Ala Glu Leu 
85 

Val Tyr Glu 



Leu Ser 

Leu Arg 

25 
Ser Arg 
40 

Glu Ser 
Thr Trp 
Arg Arg 



Leu Leu 
105 

Arg Phe Ala Lys Asp 
120 

Cys Asn 



Asn Glu 
Glu Leu 



He Leu 

Gin Glu 

Met Leu 
290 
Glu Gly 



Glu Lys Gly 
135 

Lys Asn Arg 

150 
Ser Ala Ala 
165 

Ser Leu Ala 

Gin Gin Leu 

Pro Met Pro 
215 

Thr Pro Thr 

230 
Gin Asp Lys 
245 

Lys Leu Asn 



Gin Glu 
260 

Ser Ser Gin Lys Gin 
275 

Lys Ser 



Gin Asn 



Arg Glu Ser 
295 

Asp Thr Met 



Glu Asp 

Leu Ala 

Ala Gin 
185 
Leu His 
200 

Lys Gly 

Thr Gin 

He Gin 

Asp Mqt 
265 
Asp Thr 
280 

Glu Thr 



Gin His 
10 

He Tyr 

Glu Asp 

Leu Lys 

Ala Asp 

75 
Gin Val 
90 

Asp Asn 

Glu Ala 

Leu Glu 

Ala Pro 
155 
Gin Arg 
170 

Glu Gly 
Leu Leu 
Leu Pro 



Leu Asn 
Phe Leu 



Asp Leu Lys 
15 

Glu Glu Arg 
30 

Lys Arg Asn 



Val Tyr 
45 

Arg Glu Leu Gin Asp 
60 

Glu Glu 



Glu Glu 

Asn He 

Thr Gin 
125 
Leu Ser 
140 

Gly Asp 
Asp Arg 
Leu Val 



Asp Leu Asn 
80 

Pro Gin Gin 
95 

Gin Leu Leu 
110 

Met Glu Thr 

Glu Arg Trp 

Gin Val Lys 
160 

Arg He Glu 

175 
Glu Gin Leu 
190 

Pro Gly Gly 



Pro Ser 
235 
Gin Thr 
250 

Ser Cys 



Glu Glu 
205 

Thr Gin Gin Lys Pro 
220 

Val Ser 



Thr He 
Glu Glu 
Ala Lys Leu Pro 



Glu Val 

Glu Leu 

Gin Ser 
285 
Leu Tyr 
300 

Glu Met 



Asp Ser His 
240 

Thr Asn Lys 

255 
Arg Ser Ala 
270 

Leu Lys Glu 
Gin Val He 
Leu His Gin 
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305 310 315 

Ser Gin Leu Gly Gin Leu Gin Ser Ser Glu Gly He Ala Pro Ala Gin 

330 33c 
Gin Gin Val Ala Leu Leu Asp Leu Gin Ser Ala Leu Phe Cys Ser Gin 

345 

Leu Glu lie Gin Lys Leu Gin Arg Leu Leu Arg Gin Lys Glu Arg Gin 

360 355 
Leu Ala Asp Gly Lys Arg Cys Met Gin Phe Val Glu Ala Ala Ala Gin 

375 380 
Glu Arg Glu Gin Gin Lys Glu Ala Ala Trp Lys His Asn Gin Glu Leu 
85 390 395 400 

Gin Leu His Val Leu Glu Ala Glu Lys Zyz Asn Glu lie Arg Thr Gin 



Arg Lys Ala Leu Gin His Leu Gin Gly Glu Leu His Ser Lys Ser Gin 
405 4xo 



420 425 430 

Gly Gin Asn He Gin His Leu Ser His Ser Leu Ser His Lys Glu Gin 

4J5 440 445 

Leu lie Gin Glu Leu Gin Glu Leu Leu Gin Tyr Arg Asp Thr Thr Asp 

* 3U 455 460 

Lys Thr Leu Asp Thr Asn Glu Val Phe Leu Glu Lys Leu Arg Gin Ara 
470 475 4 * 

lie Gin Asp Arg Ala Val Ala Leu Glu Arg Val He Asp Glu Lys Phe 

485 490 49c 

Ser Ala Leu Glu Glu Lys Asp Lys Glu Leu Arg Gin Leu Arg Leu Ala 

500 505 51 5 

Val Arg Asp Arg Asp His Asp Leu Glu Arg Leu Arg Cys Val Leu Ser 

515 520 525 

Ala Asn Glu Ala Thr Met Gin Ser Met Glu Ser Leu Leu Arg Ala Arg 
535 540 



Gly Leu Glu Val Glu Gin Leu He Ala Thr Cys Gin Asn Leu Gin Trp 
->*o 550 555 r 

Leu Lys Glu Glu Leu Glu Thr Lys Phe Gly His 

565 570 " 575 

Glu Ser He lie Gin Gin Leu Gin Thr Ser Leu His Asp Arg Asn Lys 

580 585 590 

Glu Val Glu Asp Leu Ser Ala Thr Leu Leu His Lys Leu Gly Pro Gly 

393 600 605 

Gin Ser Glu Val Ala Glu Glu Leu Cys Gin Arg Leu Gin Arg Lys Glu 

610 615 6 20 

Arg Val Leu Gin Asp Leu Leu Ser Asp Arg Asn Lys Gin Ala Met Glu 
630 635 640 

His Glu Met Glu Val Gin Gly Leu Leu Gin Ser Met Gly Thr Arg Glu 

645 650 655 

Gin Glu Arg Gin Ala Val Ala Glu Lys Met Val Gin Ala Phe Met Glu 

660 665 670 

Arg Asn Ser Glu Leu Gin Ala Leu Arg Gin Tyr Leu Gly Gly Lys Glu 

675 680 685 

Leu Met Ala Ala Ser Gin Ala Phe He Ser Asn Gin Pro Ala Gly Ala 

690 695 700 

Thr Ser Val Gly Pro His His Gly Glu Gin Thr Asp Gin Gly Ser Thr 

710 715 
Gin Met Pro Ser Arg Asp Asp Ser Thr Ser Leu Thr Ala Arg Glu Glu 

725 730 735 

Ala Ser He Pro Arg Ser Thr Leu Gly Asp Ser Asp Thr Val Ala Gly 
740 745 750 1 

Leu Glu Lys Glu Leu Ser Asn Ala Lys Glu Glu Leu Glu Leu Met Ala 

755 760 . 765 

Lys Lys Glu Arg Glu Ser Gin He Glu Leu Ser Ala Leu Gin Ser Met 

"° 775 780 

Met Ala Val Gin Glu Glu Glu Leu Gin Val Gin Ala Ala Asp Leu Glu 
790 795 800 

Ser Leu Thr Arg Asn He Gin He Lys Glu Asp Leu He Lys Asp Leu 
8 °5 810 81 5 



-2- 



WO 00/27861 



PCT/US99/26860 



Gin Met Gin Leu Val Asp Pro Glu Asp Met Pro Ala Met Glu Arg Leu 

820 825 830 

Thr Gin Glu Val Leu Leu Leu Arg Glu Lys Val Ala Ser Val Glu Pro 

835 840 845 

Gin Gly Gin Glu Gly Ser Glu Asn Arg Arg Gin Gin Leu Leu Leu Met 

850 855 860 

Leu Glu Gly Leu Val Asp Glu Arg Ser Arg Leu Asn Glu Ala Leu Gin 
865 870 875 880 

Ala Glu Arg Gin Leu Tyr Ser Ser Leu Val Lys Phe His Ala Gin Pro 

885 890 895 

Glu He Ser Glu Arg Asp Arg Thr Leu Gin Val Glu Leu Glu Gly Ala 

900 905 910 

Gin Val Leu Arg Ser Arg Leu Glu Glu Val Leu Gly Arg Ser Leu Glu 

915 920 925 

Arg Leu Ser Arg Leu Glu Thr Leu Ala Ala He Gly Gly Ala Thr Ala 

930 935 940 

Gly +.sp Glu Thr Glu Asp Thr Ser Thr Gin Phe Thr Asp Ser He Glu 
945 950 955 9 6 o 

Glu Glu Ala Ala His Asn Ser His Gin Gin Leu He Lys Val Ser Leu 

965 970 975 

Glu Lys Ser Leu Thr Thr Met Glu Thr Gin Asn Thr Cys Leu Gin Pro 

980 985 990 

Pro Ser Pro Val Gly Glu Asp Gly Asn Arg His Leu Gin Glu Glu Met 

995 1000 1005 

Leu His Leu Arg Ala Glu He His Gin Pro Leu Glu Glu Lys Ara Lvs 

1010 1015 1020 

Ala Glu Ala Glu Leu Lys Glu Leu Lys Ala Gin He Glu Glu Ala Glv 
1025 1030 1035 loJo 

Phe Ser Ser Val Ser His He Arg Asn Thr Met Leu Ser Leu Cys Leu 

1045 1050 1055 

Cys Leu Glu Asn Ala Glu Leu Lys Glu Gin Met Gly Glu Ala Met Ser 

1060 1065 1070 

Asp Gly Trp Glu Val Glu Glu Asp Lys Glu Lys Gly Glu Val Met Val 

1075 1080 1085 

Glu Thr Val Val Ala Lys Gly Gly Leu Ser Glu Asp Ser Leu Gin Ala 

1090 1095 HOO 

Glu Phe Arg Lys Val Gin Gly Arg Leu Lys Ser Ala Tyr Asn He He 
J 105 U10 1115 H20 

Asn Leu Leu Lys Glu Gin Leu Val Leu Arg Ser Ser Glu Gly Asn Thr 

1125 H30 H35 

Lys Glu Met Pro Glu Phe Leu Val Arg Leu Ala Arg Glu Val Asp Arg 

1140 H45 H50 

Met Asn Met Gly Leu Pro Ser Ser Glu Lys His Gin His Gin Glu Gin 

1155 H60 H65 

Glu Asn Met Thr Ala Arg Pro Gly Pro Arg Pro Gin Ser Leu Lys Leu 

1170 H75 H80 

Gly Thr Ala Leu Ser Val Asp Gly Tyr Gin Leu Glu Asn Lys Ser Gin 
U85 H90 H95 120Q 

Ala Gin Asp Ser Gly His Gin Pro Glu Phe Ser Leu Pro Gly Ser Thr 

1205 1210 1215 

Lys His Leu Arg Ser Gin Leu Ala Gin Cys Arg Gin Arg Tyr Gin Asp 

1220 1225 1230 

Leu Gin Glu Lys Leu Leu He Ser Glu Ala Thr Val Phe Ala Gin Ala 

1235 1240 1245 

Asn Gin Leu Glu Lys Tyr Arg Ala He Leu Ser Glu Ser Leu Val Lys 

1250 1255 1260 

Gin Asp Ser Lys Gin He Gin Val Asp Leu Gin Asp Leu Gly Tyr Glu 
1265 1270 1275 1280 

Thr Cys Gly Arg Ser Glu Asn Glu Ala Glu Arg Glu Glu Thr Thr Ser 

1285 • 1290 1295 

Pro Glu Cys Glu Glu His Gly Asn Leu Lys Pro Val Val Leu Val Glu 

1300 1305 1310 

Gly Leu Cys Ser Glu Gin Gly Tyr Leu Asp Pro Val Leu Val Ser Ser 



WO 00/27861 



PCT/US99/26860 



13X5 1320 1325 

Pro Val Lys Asn Pro Trp Arg Thr Ser Gin Glu Ala Arg Arg lie Gin 

1330 1335 134Q 

Ala Gin Gly Thr Ser Asp Asn Ser Ser Leu Leu Arg Lys Asp lie Arg 

I 1350 135 5 1360 

Asn Leu Lys Ala Gin Leu Pro Asn Ala Tyr Lys Val Leu Gin Asn Leu 

1365 1370 1375 

Arg Ser Arg Val Arg Ser Leu Ser Ala Thr Ser Asp Tyr Ser Ser Ser 

1380 1385 1390 

Leu Glu Arg Pro Arg Lys Leu He Ala Val Ala Thr Leu Glu Glv Ala 
1395 1400 i 4 05 

Sfir f^?n HiS Val Thr Glu Glu Glv Leu Leu SG * Asp Gly 

1410 1415 1420 

Thr Gly Ala Phe Tyr Pro Pro Gly Leu Gin Ala Lys Lys Asn Leu Glu 
; 1430 1435 i44 0 

Asn Leu lie Gin Arg Val Ser Gin Leu Glu Ala Gin Leu Pro Lys Thr 

1445 1450 ^55 

Gly Leu Glu Gly Lys Leu Ala Glu Glu Leu Lys Ser Ala Ser Trp Pro 

1460 1465 1470 * 

Gly Lys Tyr Asp Ser Leu He Gin Asp Gin Ala Arg Lys Thr Val He 

1475 1480 1485 

Ser Ala Ser Glu Asn Thr Lys Arg Glu Lys Asp Leu Phe Ser Ser His 

!490 1495 isoo 

Pro Thr Phe Glu Arg Tyr Val Lys Ser Phe Glu Asp Leu Leu Arg Asn 
J 1510 1515 1520 

Asn Asp Leu Thr Thr Tyr Leu Gly Gin Ser Phe Arg Glu Gin Leu Ser 

1525 1530 153S 

Ser Arg Arg Ser Val Thr Asp Arg Leu Thr Ser Lys Phe Ser Thr Lvs 

1540 1545 1550 

Asp His Lys Ser Glu Lys Glu Glu Val Gly Leu Glu Pro Leu Ala Phe 

1555 1560 1565 

Arg Phe Ser Arg Glu Leu Gin Glu Lys Glu Lys Val He Glu Val Leu 

lf70 1575 1580 

Gin Ala Lys Val Asp Thr Arg Phe Phe Ser Pro Pro Ser Ser His Ala 

85 I 590 1595 isoo 

Ala Ser Glu Ser His Arg Cys Ala Ser Ser Thr Ser Phe Leu Ser Asn 

1605 1610 1615 

Asp He Glu Ala Cys Ser Asp Met Asp Val Ala Ser Glu Tyr Thr His 

16 20 1625 1630 

Tyr Glu Glu Lys Lys Pro Ser Pro Ser Asn Ser Ala Ala Ser Ala Ser 

1635 1640 1645 

Gin Gly Leu Lys Gly Glu Pro Arg Ser Ser Ser He Ser Leu Pro Thr 

1650 1655 1660 

Pro Gin Asn Pro Pro Lys Glu Ala Ser Gin Ala Gin Pro Gly Phe His 

Z 1670 1675 1680 

Phe Asn Ser He Pro Lys Pro Ala Ser Leu Ser Gin Ala Pro Met His 

1685 1690 1695 

Phe Thr Val Pro Ser Phe Met Pro Phe Gly Pro Ser Gly Pro Pro Leu 

1700 1705 1710 

Leu Gly Cys Cys Glu Thr Pro Val Val Ser Leu Ala Glu Ala Gin Gin 

1715 1720 1725 

Glu Leu Gin Met Leu Gin Lys Gin Leu Gly Arg Ser Val Ser He Ala 

!730 1735 1740 

Pro Pro Thr Ser Thr Ser Thr Leu Leu Ser Asn His Thr Glu Ala Ser 
1745 1750 1755 1760 

Ser Pro Arg Tyr Ser Asn Pro Ala Gin Pro His Ser Pro Ala Arg Glv 

1765 1770 1775 

Thr He Glu Leu Gly Arg He Leu Glu Pro Gly Tyr Leu Gly Ser Glv 

1780 1785 1790 

Gin Trp Asp Met Met Arg Pro Gin Lys Gly Ser He Ser Gly Glu Leu 

1795 1800 1805 

Ser Ser Gly Ser Ser Met Tyr Gin Leu Asn Ser Lys Pro Thr Glv Ala 
18 1° 1815 1820 
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Asp Leu Leu Glu Glu- His Leu Gly Glu lie Arg Asn Leu Arg Gin Arg 

1930 1835 ioJn 

Leu Glu Glu Ser lie Cys Val Asn Asp Arg Leu Arg Glu Gin Leu Gin 

1845 1850 1855 

His Arg Leu Ser Ser Thr Ala Arg Glu Asn Gly Ser Thr Ser His Phe 

I860 1865 i87o 

Tyr Ser Gin Gly Leu Glu Ser Met Pro Gin Leu Tyr Asn Glu Asn Arg 
1875 1880 1885 

^o. Arg GlU GiU Asn Gln Ser Leu Gln Thr Arg leu Ser His Ala 
1890 1895 1900 

Ser Arg Gly His Ser Gln Glu Val Asp His Leu Arg Glu Ala Leu Leu 
„, ? 91 ° «" i 92 o 



« « 1910 i9i 5 192 

Ser Ser Ser Ser Gln Leu Gln Glu Leu Glu Lys Glu Leu Glu Gln Gln 
1925 1930 1935 

Asp Leu Gln Glu 

5 

Leu Ser Leu Gln ■ 
xy^D i960 1965 

SeX ?f?n LSU Gln HiS LyS LeU Ma Leu Leu Gln Gln Gln c ys Glu Glu 
1970 1975 198Q 



1930 1935 
Lys Ala Glu Arg Arg Gln Leu Leu Glu Asp Leu Gln Glu Lys Gln Asp 

1940 1945 1950 

Glu He Val His Phe Arg Glu Glu Arg Leu Ser Leu Gln Glu Asn Asn 
1955 i960 iQc<; 



19/0 1975 1980 

Lys Gln Gln Leu Ser Leu Ser Leu Gln Ser Glu Leu Gln He Tyr Glu 
I 965 1990 1995 2000 

^er Leu Tyr Glu Asn Pro Lys Lys Gly Leu Lys Ala Phe Ser Leu Asp 

2005 2010 2015 

Ser Cys Tyr Gln Val Pro Gly Glu Leu Ser Cys Leu Val Ala Glu He 

2020 2025 2030 

Arg Ala Leu Arg Val Gln Leu Glu Gln Ser He Gln Val Asn Asn Arg 
2035 2040 2045 



^UJS 2040 2045 

^? rt LeU Gln LCU G1U Gln Gln Met His 61 V Ala Gly Lys Ala 

2050 2055 2060 

Ser Leu Ser Ser Cys Pro Val Asn Gln Ser Phe Ser Ala Lys Ala Glu 
2065 2070 2075 2080 

Leu Ala Asn Gln Gln Pro Pro Phe Gln Gly Ser Ala Ala Ser Pro Pro 

2085 2090 2095 

Val Arg Asp Val Gly Leu Asn Ser Pro Pro Val Val Leu Pro Ser Asn 

2 1°° 2105 2110 

Ser Cys Ser Val Pro Gly Ser Asp Ser Ala He He Ser Arg Thr Asn 
. 2115 2120 2125 

n Ser Glu Ser Ala Ala T hr Lys Thr Pro Pro Lys Met Glu 

2130 2135 2140 

Val Asp Ala Ala Asp Gly Pro Phe Ala Ser Gly His Gly Arg His Val 
2145 2150 2155 2160 

He Gly His Val Asp Asp Tyr Asp Ala Leu Gln Gln Gln He Gly Glu 

2165 2170 2175 

Gly Lys Leu Leu He Gln Lys He Leu Ser Leu Thr Arg Pro Ala Aro 

2180 2185 2190 

Ser Val Pro Ala Leu Asp Ala Gln Gly Thr Glu Ala Pro Gly Thr Lys 
2195 2200 2205 

SCr ^o HiS G1U LeU Arg Ser Ser Ma ^9 A 13 Le " Asn His Ser Leu 

_ 2210 2215 2220 

Glu Glu Ser Ala Ser Leu Leu Thr Met Phe Trp Arg Ala Ala Leu Pro 
2225 223 ° 2235 2240 

Asn Ser His Gly Ser Val Leu Val Gly Glu Glu Gly Asn Leu Met Glu 

2245 2250 2255 

Lys Glu Leu Leu Asp Leu Arg Ala Gln Val Ser Gln Gln Gln Gln Leu 

2260 2265 2270 

Leu Gln Ser Thr Ala Val Arg Leu Lys Thr Ala Asn Gln Arg Lys Lys 
2275 2280 2285 

SSr o™ Glu Gln Phe Iie Val Ser His Leu Thr Arg Thr His Asp Val 

229 0 2295 2300 

Leu Lys Lys Ala Arg Thr Asn Leu Glu Met Lys Ser Phe Arg Ala Leu 
2305 23 10 2315 2320 

Met Cys Thr Pro Ala Leu 
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2325 



<210> 2 

<211> 9679 

<212> DNA 

<213> rat 

<220> 

<221> misc_f eature 
<222> (1) . . . (9679) 
<223> n = A,T,C or G 



<400> 2 
ccggtcccct 
gtttgttctt 
tgcccttttc 
gatctaaaag 
tgactcctaa 
caactttctt 
ccattggttc 
tcatcgcagc 
gaagcagggc 
ctgatgcttc 
acgtcgccat 
agcggcgaca 
acactcacgc 
cggcggccgg 
ggcggcggcc 
gcgccggcag 
gggacgcggg 
atgtcccctc 
gactggggct 
tttctactcc 
ccatcgcgtc 
tgattgccag 
agtccagtct 
tccttcaccg 
ctgctctccc 
agccgccgca 
gggccaccgc 
agccctcggg 
ggctgcaccg 
ccaccatgtc 
acttcagcct 
tcagccggga 
aacgagagct 
tcaacagcca 
aacacgttta 
caaaggatga 
agctctcaga 
tgaagcttga 
ggcagagctt 
tgttacatct 
ccacgcaaca 
cccacctggc 
aagagaaact 
aagatacgac 
agctgtacca 
accagagcca 
tggccctgct 
agaggctgtt 
tggaggctgc 
aattacgaaa 



ttggtagtag 
atgctttccc 
tctctgagtc 
aaactcccac 
ctcaggagac 
ctttttcatt 
agactcctct 
atggataaaa 
ttcagcagct 
cacggcccgg 
gtttatcccg 
ggggagaagg 
actcgcacac 
cccgcctccc 
ccggctggca 
ccagcggccg 
cctcggcagg 
agcatggtct 
cctttccagg 
attgctggaa 
ttccctgagc 
ggttatcggt 
tctaaagatg 
tttccacact 
tccacgtcgc 
gccgcccgcc 
gcgagccgct 
gtcccacagc 
tgttccaggg 
taatggatat 
caagctgcgc 
ggacgtctac 
ccaggacagg 
gaatgaagca 
tgagctccta 
agccacacag 
gaggtggaag 
ccaatattct 
ggctgcccag 
gctggaggag 
aaagccagac 
agaactccag 
gaatgacatg 
aatccaaagc 
ggtgattgaa 
gctcggacag 
tgaccttcag 
acgccagaaa 
agcacaggag 
agctttgcaa 



tatctcagag 
cagagcttcg 
catttaatat 
agataggttg 
ccattggaaa 
ggttcgtggc 
gttgacacct 
tcggattctt 
tcaatggtct 
atgagcagat 
gggctggaac 
cggaacccgc 
actccgacgc 
gccgccgggc 
acgcgatcct 
cacggacacc 
aaaaggcgcg 
tcctcctggc 
tccccagttt 
agctccagaa 
aagtttcgaa 
ttcatgttct 
attaaccaga 
cccaagagct 
cgtgtccctt 
gttggcccgg 
ccgcaccaca 
ctcagcagcg 
gtgaaggggg 
cgcactctgt 
atctacttcc 
aagcggaaca 
aaacagcatc 
gagctccggc 
gacaacaaca 
atggagactc 
gatgctacca 
gcggcactgg 
gaggggcttg 
cctgggggca 
ctaaatgaga 
gacaaaatcc 
agctgtgagc 
ctcaaggaaa 
ggtcaaaatg 
ctccagagct 
agtgctctgt 
gagcgtcagc 
agagagcagc 
cacctccaag 



ctcgccccat 
agacagcctt 
acctgggaca 
tgtttccttt 
ctggtgactg 
tgtctggtga 
ccactacagt 
gaatcctcaa 
tatccgtaca 
ccagctggtt 
tgctgcttca 
ggccggagac 
ccggatcctt 
tgagcagccc 
tccgccccgc 
tgaggctggg 
cttcacgttc 
tcaatctgtc 
ctcaagtcga 
cagagcctcc 
cgctgcgaat 
aattcaatag 
gggtagtttg 
gaactctaaa 
gcccttcccc 
cgtcctgcgg 
ggacgagaca 
tcctagcctg 
cgatcgggca 
cccagcacct 
tggaggagcg 
ttgagctgaa 
tacataaaac 
gccaggttga 
ttcagctgct 
tggtggaggc 
agaacaggga 
ctcagaggga 
tggaacagct 
tggaagtgca 
cccctacaac 
agcaaacaga 
tcagatctgc 
tjgctaaagag 
acacaatggc 
cagagggcat 
tctgcagcca 
tggctgacgg 
agaaggaagc 
gagaactgca 



agtttcatag 
tgagtccacc 
agtattttta 
tccttctctg 
ctgggtcttt 
agtatggata 
ctccgtaatg 
gcaggtagga 
gtgtgcatta 
cgtgggtccc 
cattgactta 
acacgccgtg 
gcgcgtcctc 
caccacctaa 
gcccagacag 
gagcccgcag 
tgcggaagcg 
tcaccttcag 
tcttctacct 
gccgccaacc 
caatgtaatt 
tgatggagta 
acggttaagt 
ccagcagctc 
tcagggccgc 
gaagccgagg 
aaccgcggct 
cccgctccat 
tgctcctccc 
caatgacctg 
catgcaacag 
ggttgaagtg 
atgggccgat 
agaaccgcag 
gcaggaggaa 
agagaagggg 
agatgcaccg 
caggagaatt 
gtctcgagag 
gcccatgcct 
ccagccatct 
ggtcaccaac 
acaggagtcg 
cagggaaagt 
aaagcttccg 
tgcccctgct 
gcttgaaatc 
caagcggtgc 
tgcttggaaa 
cagtaagagc 



ttcatgtctg 
agcttgaata 
tcttgaagca 
gctttcttct 
ggtttacggc 
ggcgaggcat 
acatctggcc 
gactccatat 
ctgctgtgaa 
tcatgcagag 
caccctgagc 
cgggcggcac 
cgacaggaag 
cggcaggggc 
gaagtcccgg 
gccgccctcg 
aagtctgcaa 
gtgatcctag 
ccctcttgat 
actgctgatg 
acggctcaga 
gacatccaga 
agtctaagca 
tctggagcta 
agaccggccg 
gggcctcccc 
atgtcgcctt 
gccacggcaa 
catgggtcgc 
aagaaggaga 
aagtatgaag 
gagagcctga 
gaggaggatc 
caggagacag 
tccaggtttg 
tgtaatctgg 
ggagaccagg 
gaagagctga 
aaacaacaac 
aaagggttac 
gtgtctgatt 
aagattcttc 
tctcagaagc 
gagactgaag 
gaaatgctac 
cagcagcaag 
cagaagctcc 
atgcaatttg 
cataaccagg 
caacagctcc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
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acgttctgga ggcagaaaaa tataatgaaa ttcgaaccca gggacaaaac attcaacacc 3060 
taagtcacag tctgagtcac aaagagcagc taattcagga acttcaggag ctcctacagt 3120 
atcgggatac cacagacaaa actctagaca caaatgaggt gtttcttgag aaactacggc 3180 
aacgaataca agaccgggca gttgctctag agcgggttat agatgaaaag ttctctgctc 3240 
tagaagaaaa ggacaaggaa ctgcggcagc tccggcttgc tgtgagggac cgagaccatg 3300 
acttagagag actgcgttgt gtcctgtctg ccaatgaagc tacca?Icaa agtltggaga 3360 
gtctcctgag ggccagaggc ctggaagtgg agcagttaat tgccacctgc caaaacctcc 3420 
agtggttgaa ggaagaattg gaaaccaagt ttggccactg gcagaaggaa caggagagca 3480 
tcattcagca gttacagaca tctctgcatg acaggaacaa agaagtagag gatctcagtg 3540 
caactttgct ccacaaactt ggacccggcc agagtgaagt agctgaggag ctgtgccagc 3600 
gcctgcagcg gaaggaaagg gtgctgcagg accttctgag tgatcggaac aaacaagcca 3660 
tggagcacga gatggaggtc cagggactgc tccagtcgat gggcacccgg gaacaggaaa 3720 
gacaggctgt tgcagaaaaa atggtacaag ccttcatgga aagaaactcg gaattacagg 3780 
ccctgcggca gtatctaggg gggaaggaat taatggcagc atctcaggca ttcatctcta 384 0 
accaaccagc tggagcgact tctgtaggcc cccaccatgg agagcaaact gaccaaggtt 3900 
ctacgcagat gccctctcga gacgacagca cctcgctgac tgccagagag gaggccagca 3960 
taccccggtc tacattagga gactcagaca cagttgcacg gctggagaaa gaactgagca 4020 
atgccaagga ggagcttgag ctcatggcca aaaaagaaag agaaagccag atagaattgt 4080 

ctgccctgca gtccatgatg gctgtgcaag aggaagagct gcaggtgcag gctgctgact 4140 

tggagtccct gaccaggaac atacagataa aagaagacct cataaaggac ctgcaaatgc 4200 

aactggttga ccctgaagat atgccagcca tggagcgcct gacccaagag gtcttacttc 4260 

ttcgggaaaa agttgcttca gtggaacccc agggtcagga agggtcagag aacaggagac 4320 

aacagttgct gctgatgtta gaaggactag tggatgaacg gagtcggctc aacgaggccc 4380 

tgcaagctga gcggcagctc tacagcagcc tggtcaagtt ccatgcccaa ccagagatct 4440 

ctgagagaga ccgaactctg caggtggaac tggaaggggc ccaggtgtta cgcagtcgac 4500 

tagaagaagt tcttggaaga agcctggagc gcttaagcag gctggagacc ctggccgcca 4560 

ttggaggtgc tactgcaggc gatgagactg aagatacaag cacacagttc acagacagca 4620 

ttgaggagga ggctgcacac aacagccacc agcaactcat caaggtgtct ttggagaaaa 4680 

gcctgaccac catggagacc cagaacacat gtcttcagcc cccttcccca gtaggagagg 4740 

atggtaacag gcatcttcag gaagaaatgc tccacctgag ggctgaaatc caccagccct 4800 

tagaagagaa gagaaaagct gaggcagaac tcaaggagct aaaggctcaa attgaggaag 4860 

caggattctc ctctgtgtcc cacatcagga acaccatgct gagcctttgc ctttgccttg 4920 

agaatgcaga gctgaaagag cagatgggag aagcaatgtc tgatggatgg gaggtggagg 4980 

aagacaagga gaagggcgag gtgatggtgg agaccgtggt ggccaaaggg ggtctgagtg 5040 

aggacagcct tcaggctgag ttcaggaaag tccaggggag actcaagagt gcctacaaca 5100 

tcatcaacct cctcaaagag cagctggtcc tgagaagctc ggaagggaac actaaggaga 5160 

tgccagagtt cctcgtgcgc ctggccaggg aggtggacag aatgaacatg ggcttgcctt 5220 

cctcggagaa gcatcaacac caagaacagg agaatatgac cgcaaggcct ggccccaggc 5280 

cccagagtct caagcttggg acagctctct cagtagacgg ctaccaactg gagaacaagt 5340 

cccaggccca agactctgga catcagccag aatttagcct accagggtcc accaaacacc 5400 

tgcgctccca gctggctcag tgtagacaac ggtaccaaga tctccaggag aagctgctca 5460 

tctcagaagc cactgtgttt gcccaggcaa accagctaga gaagtacaga gccatattaa 5520 

gtgaatccct ggtgaagcag gacagcaagc agatccaggt ggaccttcag gacctgggct 5580 

atgagacttg tggccgaagt gagaatgaag ctgaacgtga ggagaccacc agccctgagt 5640 

gtgaggagca cggtaacctg aagcctgtgg tgctggtgga aggcttgtgc tctgagcaag 5700 

ggtacctgga ccctgtcttg gtcagctcac ctgtgaagaa cccttggaga acaagccagg 5760 

aagccagaag aatccaggca caaggaactt cagacaacag ctctctcctg aggaaggaca 5820 

tccgaaatct gaaagcccag ctaccgaatg cctacaaggt ccttcagaac ctgaggagcc 5880 

gggtccggtc cctgtctgcc acaagcgatt actcatcgag tctggagaga ccccgcaagc 5940 

tgatagccgt ggcaaccctt gagggggcct caccccacag tgtcactgat gaagacgaag 6000 

gcttgttgtc agatggcacc ggggcttttt accctccagg gctccaggcc aaaaagaatc 6060 

tagagaatct catccagaga gtatcccagc tggaggccca gctccccaaa actggactag 6120 

aagggaagct ggctgaagaa ctgaagtccg cctcgtggcc tggaaaatac gattctttga 6180 

ttcaggatca ggcccgaaaa actgtcatat ctgcgtccga aaatacnaaa agggagaagg 6240 

atttgttttc ttctcaccca acattcgaaa gatacgtcaa atcttttgaa gacctcctga 6300 

ggaacaacga cttgactact tacctgggcc agagcttccg ggaacaactt agttcaaggc 6360 

gttcagtgac agacaggctg accagcaaat tcagcacaaa ggatcataag agtgaaaaag 6420 

aagaagttgg gcttgagcca ctggccttca ggttcagcag ggaattacag gagaaagaga 6480 

aagtgattga agtcctgcag gccaaggtgg atacccggtt tttctcaccc cccagcagcc 6540 

atgctgcgtc tgagtcccac cgttgtgcca gcagcacatc tttcctgtcg gatgacatag 6600 

aagcctgctc tgacatggac gtagccagcg agtacacaca ctatgaagag aagaagccct 6660 

cacccagtaa ctcagcagcc agtgcatctc aggggcttaa gggcgagccc agaagcagct 6720 

ccatcagctt gccaactccc cagaaccccc ctaaggaggc cagccaggcc cagccaggct 6780 
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ttcactttaa 
tacccagctt 
cagtggtgtc 
gaagtgttag 
ctagctctcc 
agctgggcag 
ctcagaaagg 
ccaaacccac 
agcgcctgga 
tcagctccac 
ccatgcctca 
cacggctcag 
tgctttcctc 
agcggcggca 
aggagaggct 
aacaacagtg 
aogagtccct 
accaagtccc 
tggagcagag 
acggtgctgg 
cggagctggc 
acgttggctt 
cagactctgc 
cccctcccaa 
acgtcatcgg 
tgctgatcca 
cgcagggcac 
ctctgaacca 
tgccaaactc 
tcctagacct 
gtctgaagac 
ccaggaccca 
ccctgatgtg 
ccaggagtcc 
ctgctgagga 
atgtgtccac 
caaggcccac 
gcactgcttc 
gatagccaac 
gccaacgctc 
cactgttgga 
gccaagtctg 
gtggtaactt 
agttgaacca 
ggagattggt 
taatggcatg 
cgtgggtgtg 
aaggccagca 
attaaagcta 



ctccataccc 
catgcctttc 
cttggctgag 
cattgcccct 
ccgctacagc 
aatcctggag 
gagcatctct 
aggggccgac 
ggagtccata 
ggcccgagaa 
gctctacaat 
tcatgcttcc 
aagttcccag 
gcttctggaa 
gtccctccag 
tgaggagaaa 
ctacgaaaat 
gggtgagttg 
cattcaagtg 
caaagccagt 
aaaccagcag 
gaattctcca 
catcatcagt 
gatggaggtc 
ccatgtggat 
aaagatactg 
agaggcacca 
cagcctagaa 
tcatggttct 
gcgagcccaa 
ggccaaccag 
tgatgtcttg 
cactccagcc 
ttaaaacagc 
atacctggtc 
atgggacagg 
tgggcccatc 
tagacagcct 
tcaggggaac 
ctcagacagt 
gactaccaaa 
gtagtacttt 
gtaggtgccc 
acttccttaa 
ggccacatca 
ggcaaggcct 
tgtataaatg 
taagtgcaga 
actgtgtac 



aagccggcta 
ggcccctctg 
gctcaacaag 
cccacctcca 
aaccctgctc 
cctggatacc 

ggggagctgt 

ctgttggaag 
tgtgtcaatg 
aatggttcca 
gagaacagag 
aggggacact 
ctccaggagc 
gacttgcagg 
gaaaacaact 
cagcagctct 
cctaagaagg 
agctgcctgg 
aacaaccgtc 
ctcagttcct 
ccacccttcc 
cccgtggtcc 
aggacaaaca 
gatgctgctg 
gactacgacg 
tctctcacga 
ggtaccaaaa 
gagtcagctt 
gtactggtag 
gtgtcccaac 
aggaagaaaa 
aagaaagcac 
ttgtgaccct 
aggaagggtg 
cgactcctcc 
caggaaggaa 
actaagcaaa 
cagcagcacg 
taggacctta 
tgtaaaagca 
cccctcactg 
ggtctctacc 
tgcacttagc 
ctctgctgtc 
tgcctattga 
gctctgtact 
tataatatag 
gcgatgggtg 



gcctttccca 
ggcctcccct 
agctgcagat 
catccacgtt 
agccccactc 
tgggcagcgg 
cctcaggctc 
agcatttagg 
acaggctacg 
cctctcactt 
ccctcaggga 
cccaggaagt 
tggagaagga 
agaagcagga 
ccaggctgca 
ccctgtccct 
gcttgaaagc 
tggcagagat 
tgcggctgca 
gccctgttaa 
aaggttcagc 
tccccagcaa 
atggttcgga 
atggcccatt 
ccctacagca 
ggccagcacg 
gtgtccatga 
ccctcctcac 
gcgaagaggg 
agcaacagct 
gcatggagca 
ggactaattt 
tgccttccag 
ggcctgcccg 
cctgctggag 
agtggcatcc 
actcatgtag 
gtgcccacct 
ccacccacaa 
cacatcattg 
acccagtctt 
accacaccag 
ctagcacctt 
ccctgaatcc 
atgtttagtg 
gaagactgtg 
atttatatat 
agaagacgct 



ggcaccaatg 
tcttggttgc 
gctgcagaag 
gcttagcaac 
cccagcaagg 
ccagtgggac 
ctcgatgtac 
tgagatccgg 
ggagcagctg 
ctacagtcag 
agaaaaccaa 
ggaccacctg 
gctggagcag 
tgagatcgtg 
gcacaagctg 
gcagtcagag 
cttcagccta 
tcgagctctg 
gctggaacag 
ccagagcttc 
tgcttcccct 
ttcgtgctct 
tgagtctgca 
tgccagtgga 
gcagattggg 
cagcgtccct 
gcttcggagc 
catgttctgg 
aaacctgatg 
ccttcagagc 
gttcatcgtg 
agagatgaaa 
gagccacata 
cccctagtac 
ctccagggaa 
tgacaatgaa 
actgtgtaga 
cgttacagtt 
acaggatgtg 
agtggcagcg 
gggccaggcc 
agagagtcta 
ctgtttctta 
taacttccct 
aacagcatat 
tcttcacagt 
gtcgctatgg 
aagcagtctt 



cacttcactg 
tgtgagacac 
cagctgggac 
cacacagaag 
ggcaccatag 
atgatgaggc 
cagcttaact 
aacctgcgcc 
cagcataggc 
ggcctggagt 
agcctgcaga 
agggaggctc 
cagaaggctg 
catttccgag 
gccctcctgc 
ctccagatct 
gattcctgtt 
agagtgcagt 
cagatggatc 
tcagccaagg 
ccagtccggg 
gttcctggct 
gcaacgaaga 
cacggcagac 
gaagggaagc 
gcactggacg 
agcgccaggg 
agagcagctt 
gagaaagaac 
actgctgtgc 
agccatctga 
tccttcaggg 
aaaggcgaag 
agctgcctgt 
gggctcatat 
tatgattagc 
aggccccccg 
ctcacctcaa 
tggtcccaat 
tccagccgga 
agctctgtgg 
tatagcaaat 
cgtgatctca 
caggggaatt 
cggtgcctct 
gctcatagga 
ctatgtgttg 
tcttatggct 
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atgtctaatg 
agcctcaagc 
cgggaggacg 
gagctccagg 
agccagaatg 
gtttatgagc 
gatgaagcca 
tcagagaggt 
cttgaccaat 
agcttggctg 
catctgctgg 
caacaaaagc 
ctggcagaac 
aaactgaatg 
acgacaatcc 
taccaggtga 
agccagctcg 
ctgcttgacc 
ctgttacgcc 
gctgcagcac 
cgaaaagctt 
ctggaggcag 
cacagtctga 
gataccacag 
atacaagacc 
gaaaaggaca 
gagagactgc 
ctgagggcca 
ttgaaggaag 
cagcagttac 
ttgctccaca 
cagcggaagg 
cacgagatgg 
gctgttgcag 
cggcagtatc 
ccagctggag 
cagatgccct 
cggtctacat 
aaggaggagc 
ctgcagtcca 
tccctgacca 
gttgaccctg 
gaaaaagttg 
ttgctgctga 
gctgagcggc 
agagaccgaa 
gaagttcttg 
ggtgctactg 
gaggaggctg 
accaccatgg 
aacaggcatc 
gagaagagaa 
ttctcctctg 
gcagagctga 
aaggagaagg 
agccttcagg 
aacctcctca 
gagttcctcg 
gagaagcatc 
agtctcaagc 
gcccaagact 
tcccagctgg 
gaagccactg 



gatatcgcac 
tgcgcatcta 
tctacaagcg 
acaggaaaca 
aagcagagct 
tcctagacaa 
cacagatgga 
ggaaggatgc 
attctgcggc 
cccaggaggg 
aggagcctgg 
cagacctaaa 
tccaggacaa 
acatgagctg 
aaagcctcaa 
ttgaaggtca 
gacagctcca 
ttcagagtgc 
agaaagagcg 
aggagagaga 
tgcaacacct 
aaaaatataa 
gtcacaaaga 
acaaaactct 
gggcagttgc 
aggaactgcg 
gttgtgtcct 
gaggcctgga 
aattggaaac 
agacatctct 
aacttggacc 
aaagggtgct 
aggtccaggg 
aaaaaatggt 
taggggggaa 
cgacttctgt 
ctcgagacga 
taggagactc 
ttgagctcat 
tgatggctgt 
ggaacataca 
aagatatgcc 
cttcagtgga 
tgttagaagg 
agctctacag 
ctctgcaggt 
gaagaagcct 
caggcgatga 
cacacaacag 
agacccagaa 
ttcaggaaga 
aagctgaggc 
tgtcccacat 
aagagcagat 
gcgaggtgat 
ctgagttcag 
aagagcagct 
tgcgcctggc 
aacaccaaga 
ttgggacagc 
ctggacatca 
ctcagtgtag 
tgtttgccca 



tctgtcccag 
cttcctggag 
gaacattgag 
gcatctacat 
ccggcgccag 
caacattcag 
gactctggtg 
taccaagaac 
actggctcag 
gcttgtggaa 
gggcatggaa 
tgagacccct 
aatccagcaa 
tgagctcaga 
ggaaatgcta 
aaatgacaca 
gag.^tcagag 
tctgttctgc 
tcagctggct 
gcagcagaag 
ccaaggagaa 
tgaaattcga 
gcagctaatt 
agacacaaat 
tctagagcgg 
gcagctccgg 
gtctgccaat 
agtggagcag 
caagtttggc 
gcatgacagg 
cggccagagt 
gcaggacctt 
actgctccag 
acaagccttc 
ggaattaatg 
aggcccccac 
cagcacctcg 
agacacagtt 
ggccaaaaaa 
gcaagaggaa 
gataaaagaa 
agccatggag 
accccagggt 
actagtggat 
cagcctggtc 
ggaactggaa 
ggagcgctta 
gactgaagat 
ccaccagcaa 
cacatgtctt 
aatgctccac 
agaactcaag 
caggaacacc 
gggagaagca 
ggtggagacc 
gaaagtccag 
ggtcctgaga 
cagggaggtg 
acaggagaat 
tctctcagta 
gccagaattt 
acaacggtac 
ggcaaaccag 



cacctcaatg 
gagcgcatgc 
ctgaaggttg 
aaaacatggg 
gttgaagaac 
ctgctgcagg 
gaggcagaga. 
agggaagatg 
agggacagga 
cagctgtctc 
gtgcagccca 
acaacccagc 
acagaggtca 
tctgcacagg 
aagagcaggg 
atggcaaagc 
ggcattgccc 
agccagcttg 
gacggcaagc 
gaagctgctt 
ctgcacagta 
acccagggac 
caggaacttc 
gaggtgtttc 
gttatagatg 
cttgctgtga 
gaagctacca 
ttaattgcca 
cactggcaga 
aacaaagaag 
gaagtagctg 
ctgagtgatc 
tcgatgggca 
atggaaagaa 
gcagcatctc 
catggagagc 
ctgactgcca 
gcagggctgg 
gaaagagaaa 
gagctgcagg 
gacctcataa 
cgcctgaccc 
caggaagggt 
gaacggagtc 
aagttccatg 
ggggcccagg 
agcaggctgg 
acaagcacac 
ctcatcaagg 
cagccccctt 
Ctgagggctg 
gagctaaagg 
atgctgagcc 
atgtctgatg 
gtggtggcca 
gggagactca 
agctcggaag 
gacagaatga 
atgaccgcaa 
gacggctacc 
agcctaccag 
caagatctcc 
ctagagaagt 



acctgaagaa 
aacagaagta 
aagtggagag 
ccgatgagga 
cgcagcagga 
aggaatccag 

. aggggtgtaa 

caccgggaga 
gaattgaaga 
gagagaaaca 
tgcctaaagg 
catctgtgtc 
ccaacaagat 
agtcgtctca 
aaagtgagac 
ttccggaaat 
ctgctcagca 
aaatccagaa 
ggtgcatgca 
ggaaacataa 
agagccaaca 
aaaacattca 
aggagctcct 
ttgagaaact 
aaaagttctc 
gggaccgaga 
tgcaaagtat 
cctgccaaaa 
aggaacagga 
tagaggatct 
aggagctgtg 
ggaacaaaca 
cccgggaaca 
actcggaatt 
aggcattcat 
aaactgacca 
gagaggaggc 
agaaagaact 
gccagataga 
tgcaggctgc 
aggacctgca 
aagaggtctt 
cagagaacag 
ggctcaacga 
cccaaccaga 
tgttacgcag 
agaccctggc 
agttcacaga 
tgtctttgga 
ccccagtagg 
aaatccacca 
ctcaaattga 
tttgcctttg 
gatgggaggt 
aagggggtct 
agagtgccta 
ggaacactaa 
acatgggctt 
ggcctggccc 
aactggagaa 
ggtccaccaa 
aggagaagct 
acagagccat 



ggagaacttc 
tgaagtcagc 
cctgaaacga 
ggatctcaac 
gacagaacac 
gtttgcaaag 
tctggagctc 
ccaggtgaag 
gctgaggcag 
acaactgtta 
gttacccacg 
tgattcccac 
tcttcaagag 
gaagcaagat 
tgaagagctg 
gctacaccag 
gcaagtggcc 
gctccagagg 
atttgtggag 
ccaggaatta 
gctccacgtt 
acacctaagt 
acagtatcgg 
acggcaacga 
tgctctagaa 
ccatgactta 
ggagagtctc 
cctccagtgg 
gagcatcatt 
cagtgcaact 
ccagcgcctg 
agccatggag 
ggaaagacag 
acaggccctg 
ctctaaccaa 
aggttctacg 
cagcataccc 
gagcaatgcc 
attgtctgcc 
tgacttggag 
aatgcaactg 
acttcttcgg 
gagacaacag 
ggccctgcaa 
gatctctgag 
tcgactagaa 
cgccattgga 
cagcattgag 
gaaaagcctg 
agaggatggt 
gcccttagaa 
ggaagcagga 
ccttgagaat 
ggaggaagac 
gagtgaggac 
caacatcatc 
ggagatgcca 
gccttcctcg 
caggccccag 
caagtcccag 
acacctgcgc 
gctcatctca 
attaagtgaa 
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1080 
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1620 
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2040 
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2700 
2760 
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2940 
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tccctggtga 
acttgtggcc 
gagcacggta 
ctggaccctg 
agaagaatcc 
aatctgaaag 
cggtccctgt 
gccgtggcaa 
ttgtcagatg 
aatctcatcc 
aagctggctg 
gatcaggccc 
ttttcttctc 
aacgacttga 
gtgacagaca 
gttgggcttg 
attgaagtcc 
gcgtctgagt 
tgctctgaca 
agtaactcag 
agcttgccaa 
tttaactcca 
agcttcatgc 
gtgtccttgg 
gttagcattg 
tctccccgct 
ggcagaatcc 
aaagggagca 
cccacagggg 
ctggaggagt 
tccacggccc 
cctcagctct 
ctcagtcatg 
tcctcaagtt 
cggcagcttc 
aggctgtccc 
cagtgtgagg 
tccctctacg 
gtcccgggtg 
cagagcattc 
gctggcaaag 
ctggcaaacc 
ggcttgaatt 
tctgccatca 
cccaagatgg 
atcggccatg 
atccaaaaga 
ggcacagagg 
aaccacagcc 
aactctcatg 
gacctgcgag 
aagacggcca 
acccatgatg 
atgtgcactc 



agcaggacag 
gaagtgagaa 
acctgaagcc 
tcttggtcag 
aggcacaagg 
cccagctacc 
ctgccacaag 
cccttgaggg 
gcaccggggc 
agagagtatc 
aagaactgaa 
gaaaaactgt 
acccaacatt 
ctacttacct 
ggctgaccag 
agccactggc 
tgcaggccaa 
cccaccgttg 
tggacgtagc 
cagccagtgc 
ctccccagaa 
tacccaagcc 
ctttcggccc 
ctgaggctca 
cccctcccac 
acagcaaccc 
tggagcctgg 
tctctgggga 
ccgacctgtt 
ccatatgtgt 
gagaaaatgg 
acaatgagaa 
cttccagggg 
cccagctcca 
tggaagactt 
tccaggaaaa 
agaaacagca 
aaaatcctaa 
agttgagctg 
aagtgaacaa 
ccagtctcag 
agcagccacc 
ctccacccgt 
tcagtaggac 
aggtcgatgc 
tggatgacta 
tactgtctct 
caccaggtac 
tagaagagtc 
gttctgtact 
cccaagtgtc 
accagaggaa 
tcttgaagaa 
cagccttgtg 



caagcagatc 
tgaagctgaa 
tgtggtgctg 
ctcacctgtg 
aacttcagac 
gaatgcctac 
cgattactca 
ggcctcaccc 
tttttaccct 
ccagctggag 
gtccgcctcg 
catatctgcg 
cgaaagatac 
gggccagagc 
caaattcagc 
cttcaggttc 
ggtggatacc 
tgccagcagc 
cagcgagtac 
atctcagggg 
cccccctaag 
ggctagcctt 
ctctgggcct 
acaagagctg 
ctccacatcc 
tgctcagccc 
atacctgggc 
gctgtcctca 
ggaagagcat 
caatgacagg 
ttccacctct 
cagagccctc 
acactcccag 
ggagctggag 
gcaggagaag 
caactccagg 
gctctccctg 
gaagggcttg 
cctggtggca 
ccgtctgcgg 
ttcctgccct 
cttccaaggt 
ggtcctcccc 
aaacaatggt 
tgctgatggc 
cgacgcccta 
cacgaggcca 
caaaagtgtc 
agcttccctc 
ggtaggcgaa 
ccaacagcaa 
gaaaagcatg 
agcacggact 



caggtggacc 
cgtgaggaga 
gtggaaggct 
aagaaccctt 
aacagctctc 
aaggtccttc 
tcgagtctgg 
cacagtgtca 
ccagggctcc 
gcccagctcc 
tggcctggaa 
tccgaaaata 
gtcaaatctt 
ttccgggaac 
acaaaggatc 
agcagggaat 
cggtttttct 
acatctttcc 
acacactatg 
cttaagggcg 
gaggccagcc 
tcccaggcac 
ccccttcttg 
cagatgctgc 
acgttgctta 
cactccccag 
agcggccagt 
ggctcctcga 
ttaggtgaga 
ctacgggagc 
cacttctaca 
agggaagaaa 
gaagtggacc 
aaggagctgg 
caggatgaga 
ctgcagcaca 
tccctgcagt 
aaagccttca 
gagattcgag 
ctgcagctgg 
gttaaccaga 
tcagctgctt 
agcaattcgt 
tcggatgagt 
ccatttgcca 
cagcagcaga 
gcacgcagcg 
catgagcttc 
ctcaccatgt 
gagggaaacc 
cagctccttc 
gagcagttca 
aatttagaga 



ttcaggacct 
ccaccagccc 
tgtgctctga 
ggagaacaag 
tcctgaggaa 
agaacctgag 
agagaccccg 
ctgatgaaga 
aggccaaaaa 
ccaaaactgg 
aatacgattc 
cnaaaaggga 
ttgaagacct 
aacttagttc 
ataagagtga 
tacaggagaa 
caccccc-:ag 
tgtcggatga 
aagagaagaa 
agcccagaag 
aggcccagcc 
caatgcactt 
gttgctgtga 
agaagcagct 
gcaaccacac 
caaggggcac 
gggacatgat 
tgtaccagct 
tccggaacct 
agctgcagca 
gtcagggcct 
accaaagcct 
acctgaggga 
agcagcagaa 
tcgtgcattt 
agctggccct 
cagagctcca 
gcctagattc 
ctctgagagt 
aacagcagat 
gcttctcagc 
cccctccagt 
gctctgttcc 
ctgcagcaac 
gtggacacgg 
ttggggaagg 
tccctgcact 
ggagcagcgc 
tctggagagc 
tgatggagaa 
agagcactgc 
tcgtgagcca 
tgaaatcctt 



gggctatgag 
tgagtgtgag 
gcaagggtac 
ccaggaagcc 
ggacatccga 
gagccgggtc 
caagctgata 
cgaaggcttg 
gaatctagag 
actagaaggg 
tttgattcag 
gaaggatttg 
cctgaggaac 
aaggcgttca 
aaaagaagaa 
agagaaagtg 
cagccatgct 
catagaagcc 
gccctcaccc 
cagctccatc 
aggctttcac 
cactgtaccc 
gacaccagtg 
gggacgaagt 
agaagctagc 
catagagctg 
gaggcctcag 
taactccaaa 
gcgccagcgc 
taggctcagc 
ggagtccatg 
gcagacacgg 
ggctctgctt 
ggctgagcgg 
ccgagaggag 
cctgcaacaa 
gatctacgag 
ctgttaccaa 
gcagttggag 
ggatcacggt 
caaggcggag 
ccgggacgtt 
tggctcagac 
gaagacccct 
cagacacgtc 
gaagctgctg 
ggacgcgcag 
cagggctctg 
agctttgcca 
agaactccta 
tgtgcgtctg 
tctgaccagg 
cagggccctg 
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ggatccttga 
ctgtggcccg 
ggatttgggg 
cggactacag 
cttgaaatct 
cgggagccca 
agctggggag 
cttaggcgtt 
gagtcggcgc 
atgctgaccg 
acccagtagc 
aaggagattt 
cacacggcgt 
ccccgcgatg 
tatcgattcg 
ctgctactgg 
gatgactcL? 
gatgcgagat 
gtggagaatg 
aaccgaccca 
gccatttgta 
tggtcgacca 
gcaagcacaa 
gtggaatctt 
gagagaagtg 
catgggtgta 
cccatccaga 
gccaagcctg 
aaaccccttt 
tgggatgatc 
ctgaaacaac 
tcccacttgg 
caagagaaac 
caagatggta 
gagttgtacc 
caccaaagcc 
gtagctctgc 
cagagggtgg 
gtagaggctg 
gaattgcgaa 
cgtgcctggg 
ctaaaccata 
tatcgagata 
cagcgaatac 
ctagaagaga 
gacttagaga 
agtctcctga 
cagtggctga 
atcattcagc 
gcaacactgc 
cgtctacagc 
ctggaacatg 
agccaagctg 
gccctgcgcc 
caacaagctg 
cagatacctt 
agatccacat 
aaagaggaac 
ctacagtcca 
tctctgacca 
gttgatcctg 
gaaaaagttg 
ttgctgctga 



gggcactggt 
cgccagtagt 
cgagggtcgc 
ctcccagaag 
gatcctccat 
gtctgctaaa 
agacatggca 
tccacgcccg 
gcggggcagg 
gagaccaccg 
cgccgccgcc 
gcaggatctg 
ccaagctcaa 
gcaaagccga 
acacagttat 
agaaggatcg 
gcgcggagat 
actctgcact 
aggatcagat 
ggagatgccg 
aggtacctcg 
gcatttgcac 
aggtaccccc 
tggatgcaag 
caaaggaact 
atcacaagct 
gcccccgagg 
gccctagcat 
acaagacaec 
tctgtgaaga 
aaaagctgaa 
cagaactcca 
ttaatgaaat 
caattcagaa 
aggtaattga 
agcttggaca 
ttgatcttca 
tacgacagaa 
cagcacacga 
aagccttgca 
aggctgaaaa 
gtctgagtca 
actcagacaa 
atgataaagc 
aagaaaaaga 
gactgcgcga 
gggccaaagg 
aagaagaaat 
agttacagac 
tctgcaaact 
gaaaggaaag 
aaatggagat 
ctgcagagaa 
aatatttagg 
aagttacccc 
ccagagatga 
taggagactt 
ttgaactcat 
tgatggctgt 
ggaacataca 
aagacatacc 
cttcagtaga 
tgctagaagg 



gcgactttca 
gctttctgct 
gctggctacc 
agccttgtgg 
ccccgaggct 
aggggaggac 
gggtcggagc 
ccccctgccc 
ggcgcgcccc 
aggcgggaga 
gccgcctcgg 
tgcccgagag 
tctccaggtt 
gttcgcttgc 
tgcccggatt 
cctcaagttc 
caaggcgggg 
gctccaggag 
ccaggagcca 
tggttgtgcc 
aaaggtggcc 
tgaagaacca 
agatggagaa 
cgtccaggct 
tggaaagtgt 
ggaattagct 
gagcaggctt 
gacagatgga 
tgtgagttat 
ttatttgccg 
ttcacatgag 
ggaaaaaatc 
gagctatgaa 
cctcaaggaa 
aggtcaaaat 
acttcacagc 
gagtgcttta 
agagcgccaa 
gagtgaacag 
gcagctacaa 
atacaatgag 
caaggagcag 
aacccttgaa 
tgttgctctg 
actgcgccag 
tgtcctctcc 
cctggaagtg 
ggaaaccaaa 
gtctcttcat 
tggaccaggg 
gatgctgcag 
tcaaggcctg 
gttggtgcaa 
agggagagac 
cactggccgt 
tagcacttca 
ggacacagtt 
ggctaaaaaa 
gcaggaagaa 
gattaaagaa 
agctatggaa 
atcccagggt 
actagtagat 



ggtgaggtct 
ccgcactcgc 
tcgcatgcgc 
aggccgcaga 
ttgcgtctgc 
gttgaggacg 
gcggcctgcg 
gaggggcggg 
tgcagagtgg 
cagagcgcgg 
gaagcttgcc 
ctgtgtggaa 
ctgctttcgc 
agcaagtgtg 
gaagcgcttt 
tgcattgcca 
aatgggacgg 
gacttcgcct 
cacagctgcc 
gctttgcggg 
agaagtatct 
gcgttgtctg 
agcatggagg 
agccctccac 
gactgttgtt 
cttagcatga 
ccgattccag 
gttagttccg 
cccttggagc 
ctccgggtcc 
accactataa 
cagcaaacag 
ctaaagtgtg 
actctgaaaa 
gacacaatgg 
tcagagggta 
ttctgcagcc 
ctggctgatg 
cagaaagagg 
gaagaattgc 
attcgaaccc 
ttgcttcagg 
gcaaatgaaa 
gagcgggcta 
cttcgtcttg 
tccaatgaag 
gaacagttat 
tttagccgtt 
gataggaaca 
cagagtgaga 
gaccttctaa 
cttcagtctg 
gccttaatgg 
tccctgatgt 
cttggaaaac 
ttgactgcca 
gcagggctgg 
gaaagagaaa 
gagctgcagg 
gatctcataa 
cgcctgaccc 
caagaaattt 
gaacggagtc 



tagcagatga 
cgtgagccag 
agagccggaa 
cgcgaagccg 
gcggccggcc 
cggcggctgg 
cctctgtcac 
gctgacggct 
ggaccccact 
cgaagagcca 
acccgctagg 
accagcggcg 
acgtcttggg 
ctttcatgct 
ctattgagcg 
gtatgtatcg 
ttgacatgtc 
attcagggtt 
atggttcaga 
ttgctgattc 
cctgcggccc 
aggttgggcc 
aagagacgcc 
aacagaaaga 
cagatgatca 
ttaaaggtct 
tgaaatccag 
gtttccttaa 
tttcagacct 
agcccatgac 
ctcagcagtc 
aggccaccaa 
ctcaggagtc 
gcagggaacg 
caaagcttcg 
cttctccagc 
aacttgaaat 
ccaaacaatg 
cttcttggaa 
agaataagag 
aggaacaaaa 
aatttcggga 
tgttgcttga 
tagatgaaaa 
ctgtgagaga 
ctactatgca 
ctactacctg 
ggcagaagga 
aagaagtgga 
tagcagagga 
gtgatcgaaa 
tgagcaccag 
aaagaaattc 
cccaagcacc 
agactgatca 
aagaggatgt 
aaaaagaact 
gtcagatgga 
tgcaggctgc 
aggacctgca 
aggaagtctt 
caggaaaccg 
ggctcaatga 



aagcggctgg 
gtgtgcaacc 
gcccgctgac 
ctggcgccat 
gctgctgctc 
cgggagagac 
tcagcatcct 
ctggtacccg 
gggctgtgcc 
ttgagtggtc 
agggaagatg 
ctggatcttc 
caaggatgtc 
tgatcgaatc 
cttgcaaaag 
gaagaataac 
cgtcttaccc 
tgagtgctgg 
aggccctgga 
tgactatgaa 
ttctagcagg 
acccgactta 
tggttcctct 
tgaggagact 
ggctccgcag 
tgattataag 
cctacctgga 
caggtctttg 
gcaggagctg 
tgaagagttg 
tgtatctgat 
caagattctt 
gtctcaaaag 
tgagactgag 
agaaatgctg 
tcagcaacag 
acagaagctc 
tgtgcaattt 
acataaccag 
ccaacagctt 
catccagcac 
gctcctacag 
gaaacttcgc 
attctctgct 
gcgagatcat 
aagtatggag 
tcaaaacctc 
acaagagagt 
ggatcttagt 
gctgtgccag 
taaacaagtg 
ggagcaggaa 
agaattacag 
catctctaac 
aggttcaatg 
cagcataccc 
gagtaatgcc 
actttctgct 
tgatatggag 
aatgcaactg 
acttcttcgg 
aagacaacag 
ggccttacaa 
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gcagagagac 
agagaccgaa 
gaagttcttg 
ggtgcagctg 
gaggaggctg 
gcaactgtgg 
aacaggtgtc 
gagaagagga 
ttctcctcag 
ctgaaagagc 
aagggcgagg 
caggctgagt 
ctcaaagaac 
cttgtgcatc 
aagcaccaac 
cttgaccttg 
cgtgaccctg 
cagctgtcac 
gccactgtct 
tccttggtga 
acttgtggcc 
gagcacaaca 
cggggctcaa 
caggaagagt 
aaagatctga 
gtccggtccc 
agagctgttg 
tggctgtctg 
gagagtctca 
gagaagctgg 
caggatcagg 
tgttatctta 
agcaatgaca 
cagctgacag 
caagctggac 
gtgattgaag 
gccttgtctg 
gcctgctctg 
cccagtcact 
tcatcaacca 
ccaactcccc 
tccataccca 
ctgcctttca 
ttggctgagg 
actgttcctc 
tactacctca 
atcctagagc 
gggagtgtat 
acaggggctg 
gaggagtcca 
actgctcgtg 
cagctctgca 
agtcatgttt 
tctcgatccc 
cagcttttgg 
ctttccctcc 
aagctggttc 
tcagagctac 
agcctggatg 
gtacgagctc 
cagctgcaac 
aaccagaact 
gctgtgtccc 



agctctatag 
ctctgcaggt 
gaagaagctt 
caggggatga 
cacaccatag 
agacccagaa 
ttcaggaaga 
aagctgagga 
tgtcccacat 
agatgggaga 
tgatggttga 
tcagaaagct 
aacttgtgct 
tgaccagcac 
accaagagga 
gggctacctt 
ggcctcagtc 
aatgcaaaca 
ttgctcaggc 
agcaggacag 
gaagcgagaa 
gcctcaagga 
cactggctag 
tccgggtata 
aggcccagct 
tctcagttac 
gcaccttgga 
atggcactgg 
tccagagagt 
ctgaggagct 
cccgggaact 
tcacccggca 
ttgactacta 
agaggctcac 
ttgagccact 
tcctgcaggc 
actcccaccg 
acatggacat 
cagattccat 
gtgcatctca 
agaatacccc 
agctggctag 
gccccactgg 
ctcagcagga 
ctgcttccac 
actctgccca 
ctgggtacct 
ctggggacct 
acctgctgga 
tctgcatcaa 
gaaggggatc 
atgagaacag 
ccagagagca 
accttcaaga 
aagacttgag 
aggaaaacga 
tcctgcagca 
aaatctacga 
cctgt caeca 
tgagagggca 
agcagctgga 
tcccagccag 
ctccagtccg 



cagtctggtg 
ggaactggaa 
ggagegctta 
caccgaagat 
tcaccagcaa 
cccatctttt 
aatgctccac 
ggaactgaag 
caggaacacc 
agcaatgtct 
gactgtggta 
ccagggaaaa 
gagtagcaag 
cattgaaaga 
ggggaatgtg 
cacagtggat 
agegtttage 
aegctatcaa 
taacgagctg 
caagcagatc 
tgaggctgaa 
aatggtcctg 
ttcctctgag 
tggaaagtca 
geagaatgee 
aagtgattat 
ggggtcttca 
ggctttctac 
atcccagctg 
gagatcagee 
gtcttaccta 
tgcaaaagat 
cctgggacag 
cagcaaactc 
ggccctcagg 
caagctggat 
ctctcccagc 
agtcagegag 
ccatcattcg 
gggggctaag 
caaggaggee 
ccttcctcag 
ccctctcctc 
gctacagatg 
agctacattg 
gcctcactct 
gggcagcagt 
atcctcaggc 
agagcatctt 
tgaccgccta 
cacttctaac 
agtcctcagg 
ctcccaggaa 
gctggaaaag 
ggagaagcag 
ctccagtggg 
acagtgtgaa 
ggcactttat 
aatccctttg 
gctggagcag 
gageggtget 
cactgaccct 
ggatgttggt 



aagttccatg 
ggggctcagg 
aacaggctgg 
acaagcactg 
cttgtcaagg 
tcccctcctt 
ctgagggctg 
gagctaaagg 
atgetgagee 
gatggatggg 
accaaagagg 
ctgaagaatg 
gaagggaata 
ataaacacag 
actgtgaggc 
gcccaccaat 
cvaccagggt 
gatctccagg 
gagaaataca 
caggtggacc 
egggaggaaa 
atggaggggc 
aggaagcect 
gaaaacatct 
aacaaggtca 
tegtctagtc 
cctcatagtg 
tctccagggc 
gaggcccagc 
tcgtggcctg 
eggcaaaaaa 
acagtaaaat 
agettceggg 
agcaccaagg 
ctcagcaggg 
gctcggtccc 
agcacctctt 
tacacacact 
agtcattctg 
gccgaatcca 
aaccaggccc 
gcaccattgc 
ettggctget 
ctgeagaage 
ctgagcaacg 
cctccaaggg 
ggcaagtggg 
tcctctgtgt 
ggtgaaatcc 
egggagcaac 
ttctacagtc 
gaagacaatc 
acagaaagee 
gagctggagc 
caagaggtct 
ccttgcctct 
gagaaacagc 
ggcaattcca 
agcagtgacc 
agcattcagg 
ggcaaageca 
ggaaacaagc 
atgaattccc 



cccatccaga 
tgttacgcag 
agaccctggc 
agttcactga 
tggctttgga 
ctccgatggg 
agttccacca 
ctcaaattga 
tttgccttga 
agatcgagga 
gtctgagtga 
cccacaatat 
gtaaacttac 
aactggttgg 
ctttccccag 
tggataacca 
ccacccagca 
agaagctget 
gagttatget 
tccaggacct 
ccaccagtcc 
tgtgctctga 
tggagaacca 
tggtcctacg 
ttcaaaacct 
tggaaagacc 
tccctgatga 
ttcaggccaa 
tcccaaaaaa 
ggaaatatga 
tacgagaagg 
cttttgagga 
agcaactcgc 
atcataaaag 
agetgeagga 
tcacaccctc 
tcctgtctga 
atgaagagaa 
ctgtgttgtc 
acagcaaccc 
attcaggctt 
cctcagctcc 
gtgagacacc 
agttgggaga 
acttggaagc 
gcaccataga 
atgtgatgag 
accagcttaa 
ggaacctgcg 
tggaacaccg 
agggcctgga 
gaagacttca 
tgagggaggc 
accagaaggt 
tgcatttcag 
ccctggtcag 
agctctttga 
agaaggggct 
tgagccacct 
ggaacaattg 
gcctcagccc 
agctgctcct 
cagctctggt 



gagctctgag 
teggctagaa 
cgccattgga 
cagtattgag 
gaaaagtctg 
aggggacagt 
gcacttagaa 
ggaagcagga 
gaatgeggag 
agacaaggag 
gagtagcett 
catcaacctc 
tccagagctc 
ttcccctggg 
accccagagc 
gtcccagcct 
cctgcgctcc 
gctatcagaa 
tacaggtgaa 
gggctatgag 
tgagtgtgag 
geagggaege 
gctagggaag 
aaaggacatc 
caagagcegg 
ccggaagctg 
ggatgagggg 
aaaggacctg 
tggactagaa 
ttccctgatt 
gagaggtatt 
tctcctaagg 
ccagggaagc 
tgagaaagat 
gaaggagaaa 
cagcagccat 
tgaactggaa 
gaaagcttct 
ttctaaacca 
catcagcttg 
tcattttcac 
atccagcttc 
agtggtctcc 
aagtgccagc 
cgactcttcc 
actgggaaga 
gectcagaaa 
ctccaaaccc 
ccagcgcctg 
gctgacctct 
gtccatacct 
ggctcaactg 
tctgctgtcc 
ggaaaggcag 
ggaggaacgt 
actgcagcac 
gtccctccag 
gaaagcttac 
ggtggcagag 
tetgegactg 
ctcctccatt 
ccaagattca 
cttccccagc 



3B40 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 
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tctgcttcct ctactcctgg ctcagaaacg cccataatca acagagcaaa tggcttgggt 7620 

ttggatactt ctccagtaat gaagacccct cccaagctag agggtgatgc tactgatggc 7680 

tcctttgcca ataagcatgg ccgccatgtc attggccaca ttgatgacta cagtgcccta 7740 

agacagcaga ttgcggaggg caagctgctg gtcaaaaaga tagtgtctct tgtgagatca 7800 

gcgtgcagct tccctggcct tgaagcccaa ggcacagagg tgctaggcag caaaggtatt 7860 

catgagcttc ggagcagcac cagtgccctg caccatgccc tagaggagtc ggcttccctc 7920 

ctcaccatgt tctggagagc agccctgcca agcacccaca tccctgtgct gcctggcaaa 7980 

gtgggagaat caacagaaag ggaacttctg gaactgagaa ccaaagtatc caaacaggag 8040 

cggctccttc agagcacaac tgagcatctg aagaacgcca accagcagaa ggagagcatg 8100 

gagcagttca tcgtcagcca gctaaccaga acacatgatg ttttaaagaa ggcaaggact 8160 

aacttagagg tgaaatccct aagggctctg ccatgtactc cagccttgtg acccttgcct 8220 

tccaggaacc atgcaagaag cgcagccacc agaagtcctt aaaacagcag gaaaggtggg 8280 

cctgtccccc ttttgtgcag ctacctatct gctgaggagc atctgggcct cattcctcca 8340 

agtccacggg agggtccaga agagggagtc agagatgtat cctggtggag ctgggagaaa 8400 

ggcagaaagc ctttctgaca gctatggaat acgattagcc aaggtccact tggcccagca 8460 

ctaagaaaaa gatgcgtagt ttgcacagaa ggttttgtga tcctgcctct caacagcccc 8520 

agcagcttgg gaactagcaa gagcacattt cttgcctcat cagctgtcct gagatggnaa 8580 

actcagtgga tataggaccc tgattccgat gaaaggggca cgtggtccca atgctggagc 8640 

tcctctggca ggttctaaaa gcacactact gagcagcggt gccctgccgg acactgctgg 8700 

cgggggctca gtgagcacta ctcacagatc cacacctgac cctgttgggt cgagtcaggc 8760 

tgggccttgg tctgcactgt agcacctgtg ttctttgagt tcacatcatg aatgtggtga 8820 
cttcccagat accatctcag gcttaaccta gcacatccta tttcttttct tctatgatat 
ccaaattgga ctgacctcac ttcaaagttg ctgtcccatt ttgtcaccct atcttatctc 
ggggaaattg cagactgatg gccagaccaa ctctgttgaa attcttgcat agagcaaacc 

tgtgctcatt tttaagtggc atgggagagg cccccagcct agtaaagcct agtctgtgtc yuou 

ttcacagtgc tggtagaatg tgtttgtgtg tataaatata tgatatagat ttatatatgt 9120 

tgctaacgcc atatattgaa ggccaacata actggtggac agggtgggtg acagaaaatg 9180 

aaagcctttt tggtgattgt taaagcaaga tgtgtataaa gaaataaata gtttttcttt 9240 



8880 
8940 
9000 
9060 



9241 



<210> 5 
<211> 2517 
<212> PRT 
<213> human 

<400> 5 

Met Lys Glu He Cys Arg He Cys Ala Arg Glu Leu Cys Gly Asn Gin 

1 5 io 15 

Arg Arg Trp lie Phe His Thr Ala Ser Lys Leu Asn Leu Gin Val Leu 

20 25 30 

Leu Ser His Val Leu Gly Lys Asp Val Pro Arg Asp Gly Lys Ala Glu 

35 40 45 

Phe Ala Cys Ser Lys Cys Ala Phe Met Leu Asp Arg He Tyr Arg Phe 

50 55 60 

Asp Thr Val He Ala Arg He Glu Ala Leu Ser He Glu Arg Leu Gin 
65 70 75 80 

Lys Leu Leu Leu Glu Lys Asp Arg Leu Lys Phe Cys He Ala Ser Met 

85 90 95 

Tyr Arg Lys Asn Asn Asp Asp Ser Gly Ala Glu He Lys Ala Gly Asn 

100 105 no 

Gly Thr Val Asp Met Ser Val Leu Pro Asp Ala Arg Tyr Ser Ala Leu 

115 120 125 

Leu Gin Glu Asp Phe Ala Tyr Ser Gly Phe Glu Cys Trp Val Glu Asn 

130 135 140 

Glu Asp Gin He Gin Glu Pro His Ser Cys His Gly Ser Glu Gly Pro 
145 150 155 160 

Gly Asn Arg Pro Arg Arg Cys Arg Gly Cys Ala Ala Leu Arg Val Ala 

165 170 175 

Asp Ser Asp Tyr Glu Ala He Cys Lys Val Pro Arg Lys Val Ala Arg 

180 185 190 

Ser He Ser Cys Gly Pro Ser Ser Arg Trp Ser Thr Ser He Cys Thr 
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195 

Glu Glu Pro 

210 
Lys Val Pro 
225 

Ser Val Glu 



Lys Asp Glu 

Cys Cys Ser 
275 

Glu Leu Ala 

290 
Ser Pro Arg 
305 

Gly Ala Lys 

Leu Asn Arg 

Leu Glu Leu 
355 

Tyr Leu Pro 

370 
Gin Lys Leu 
385 

Asp Ser His 

Thr Asn Lys 

Lys Cys Ala 
435 

Leu Lys Glu 

450 
Gin Val He 
465 

Leu His Gin 

Pro Ala Gin 

Cys Ser Gin 
515 

Glu Arg Gin 

530 
Ala Ala His 
545 

Gin Glu Leu 

Lys Ser Gin 

Arg Thr Gin 
595 

Lys Glu Gin 

610 
Asn Ser Asp 
625 

Arg Gin Arg 

Glu Lys Phe 

Arg Leu Ala 
675 

Val Leu Ser 
690 



200 205 
Ala Leu Ser. Glu Val Gly Pro Pro Asp Leu Ala Ser Thr 

215 220 
Pro Asp Gly Glu Ser Met Glu Glu Glu Thr Pro Gly Ser 
230 235 240 

Ser Leu Asp Ala Ser Val Gin Ala Ser Pro Pro Gin Gin 

250 255 
Glu Thr Glu Arg Ser Ala Lys Glu Leu Gly Lys Cys Asp 

265 270 
Asp Asp Gin Ala Pro Gin His Gly Cys Asn His Lys Leu 

280 285 
Leu Ser Met He Lys Gly Leu Asp Tyr Lys Pro He Gin 

295 300 
Gly Ser Arg Leu Pro He Pro Val Lys Ser Ser Leu Pro 
310 315 320 

Pro Gly Pro Ser Met Thr Asp Gly Val Ser Ser Gly Phe 

325 330 335 

Ser Leu Lys Pro Leu Tyr Lys Thr Pro Val Ser Tyr Pro 
340 345 35 0 

Ser Asp Leu Gin Glu Leu Trp Asp Asp Leu Cys Glu Asp 

360 365 
Leu Arg Val Gin Pro Met Thr Glu Glu Leu Leu Lys Gin 

375 3 8 o 
Asn Ser His Glu Thr Thr He Thr Gin Gin Ser Val Ser 
390 395 400 

Leu Ala Glu Leu Gin Glu Lys He Gin Gin Thr Glu Ala 

405 410 415 

He Leu Gin Glu Lys Leu Asn Glu Met Ser Tyr Glu Leu 
420 425 43Q 

Gin Glu Ser Ser Gin Lys Gin Asp Gly Thr He Gin Asn 

440 445 
Thr Leu Lys Ser Arg Glu Arg Glu Thr Glu Glu Leu Tyr 

455 460 
Glu Gly Gin Asn Asp Thr Met Ala Lys Leu Arg Glu Met 
470 475 4eo 

Ser Gin Leu Gly Gin Leu His Ser Ser Glu Gly Thr Ser 

485 490 J 495 

Gin Gin Val Ala Leu Leu Asp Leu Gin Ser Ala Leu Phe 
500 505 510 

Leu Glu He Gin Lys Leu Gin Arg Val Val Arg Gin Lvs 

520 525 
Leu Ala Asp Ala Lys Gin Cys Val Gin Phe Val Glu Ala 

535 540 
Glu Ser Glu Gin Gin Lys Glu Ala Ser Trp Lys His Asn 
550 555 560 

Arg Lys Ala Leu Gin Gin Leu Gin Glu Glu Leu Gin Asn 

565 570 5?5 

Gin Leu Arg Ala Trp Glu Ala Glu Lys Tyr Asn Glu He 
580 585 59 0 

Glu Gin Asn He Gin His Leu Asn His Ser Leu Ser His 

600 € 05 
Leu Leu Gin Glu Phe Arg Glu Leu Leu Gin Tyr Arg Asp 

615 620 
Lys Thr Leu Glu Ala Asn Glu Met Leu Leu Glu Lys Leu 
630 635 640 

He His Asp Lys Ala Val Ala Leu Glu Arg Ala He Aso 

645 . 650 655 

Ser Ala Leu Glu Glu Lys Glu Lys Glu Leu Arg Gin Leu 
660 665 €70 

Val Arg Glu Arg Asp His Asp Leu Glu Arg Leu Arg Asp 

680 685 
Ser Asn Glu Ala Thr Met Gin Ser Met Glu Ser Leu Leu 
695 700. 
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Arg Ala Lys Gly Leu Glu Val Glu Gin Leu Ser Thr Thr Cys Gin Asn 

, , 710 715 720 

Leu Gin Trp Leu Lys Glu Glu Met Glu Thr Lys Phe Ser Arg Trp Gin 

725 730 735 

Lys Glu Gin Glu Ser He He Gin Gin Leu Gin Thr Ser Leu His Asn 

740 745 750 

Arg Asn Lys Glu Val Glu Asp Leu Ser Ala Thr Leu Leu Cys Lys Leu 

755 760 7 6 5 

Gly Pro Gly Gin Ser Glu He Ala Glu Glu Leu Cys Gin Arg Leu Gin 

770 775 7bo 

Arg Lys Glu Arg Met Leu Gin Asp Leu Leu Ser Asp Arg Asn Lys Gin 
790 7 95 goo 

Val Leu Glu His Glu Met Glu He Gin Gly Leu Leu Gin Ser Val Ser 

80S 810 815 

Thr Arg Glu Gin Glu Ser Gin Ala Ala Ala Glu Lys Leu Val Gin Ala 

820 825 830 

Leu Met Glu Arg Asn Ser Glu Leu Gin Ala Leu Ai j Gin Tyr Leu Gly 

835 840 845 

Gly Arg Asp Ser Leu Met Ser Gin Ala Pro He Ser Asn Gin Gin Ala 

850 855 860 

Glu Val Thr Pro Thr Gly Arg Leu Gly Lys Gin Thr Asp Gin Gly Ser 
5 870 875 880 

Met Gin He Pro Ser Arg Asp Asp Ser Thr Ser Leu Thr Ala Lys Glu 

885 890 895 

Asp Val Ser He Pro Arg Ser Thr Leu Gly Asp Leu Asp Thr Val Ala 

900 90S 910 

Gly Leu Glu Lys Glu Leu Ser Asn Ala Lys Glu Glu Leu Glu Leu Met 

915 920 925 

Ala Lys Lys Glu Arg Glu Ser Gin Met Glu Leu Ser Ala Leu Gin Ser 

930 935 940 

Met Met Ala Val Gin Glu Glu Glu Leu Gin Val Gin Ala Ala Asp Met 
945 950 955 960 

Glu* Ser Leu Thr Arg Asn He Gin He Lys Glu Asp Leu He Lys Asd 
965 970 975 V 

Leu Gin Met Gin Leu Val Asp Pro Glu Asp He Pro Ala Met Glu Ara 

980 985 990 

Leu Thr Gin Glu Val Leu Leu Leu Arg Glu Lys Val Ala Ser Val Glu 

"5 1000 1005 

Ser ?i?„ Gly Gln Glu Ue Ser G1 * ten Arg Arg Gin Gin Leu Leu Leu 

1010 1015 1020 

Met Leu Glu Gly Leu Val Asp Glu Arg Ser Arg Leu Asn Glu Ala Leu 
1025 1030 1035 1040 

Gin Ala Glu Arg Gin Leu Tyr Ser Ser Leu Val Lys Phe His Ala His 

1045 1050 1055 

Pro Glu Ser Ser Glu Arg Asp Arg Thr Leu Gin Val Glu Leu Glu Gly 

1060 1065 1070 

Ala Gin Val Leu Arg Ser Arg Leu Glu Glu Val Leu Gly Arg Ser Leu 

1° 7 5 1080 1085 

Glu Arg Leu Asn Arg Leu Glu Thr Leu Ala Ala lie Gly Glv Ala Ala 

1090 1095 iioo 

Ala Gly Asp Asp Thr Glu Asp Thr Ser Thr Glu Phe Thr Asp Ser He 

105 H 10 1115 1120 

Glu Glu Glu Ala Ala His His Ser His Gin Gin Leu Val ' Lys Val Ala 

H25 H30 H35 

Leu Glu Lys Ser Leu Ala Thr Val Glu Thr Gin Asn Pro Ser Phe Ser 

U 40 1145 1150 

Pro Pro Ser Pro Met Gly Gly Asp Ser Asn Arg Cys Leu Gin Glu Glu 

II 55 H60 U65 

Met Leu His Leu Arg Ala Glu Phe His Gin His Leu Glu Glu Lvs Aro 

H 7 0 1175 U80 

Lys Ala Glu Glu Glu Leu Lys Glu Leu Lys Ala Gin He Glu Glu Ala 
i* 8 * 1190 1195 1200 

Gly Phe Ser Ser Val Ser His He Arg Asn Thr Met Leu Ser Leu Cys 
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1205 1210 12 i5 

Leu Glu Asn Ala Glu Leu Lys Glu Gin Met Gly Glu Ala Met Ser Asn 

1220 1225 1230 

Gly Trp Glu lie Glu Glu Asp Lys Glu Lys Gly Glu Val Met Val Glu 
^35 1240 1245 

YoL Val Thr LyS Glu Leu Ser Glu Ser s « Gin Ala Glu 
1250 1255 1260 

Phe Arg Lys Leu Gin Gly Lys Leu Lys Asn Ala His Asn lie He Asn 
T 1265 1270 1275 1280 

Leu Leu Lys Glu Gin Leu Val Leu Ser Ser Lys Glu Gly Asn Ser Lvs 

1285 1290 1295 

Leu Thr Pro Glu Leu Leu Val His Leu Thr Ser Thr He Glu Arg He 

13°° 1305 1310 

Asn Thr Glu Leu Val Gly Ser Pro Gly Lys His Gin His Gin Glu Glu 

1315 1320 1325 

Gly Asn Val Thr Val Arg Pro Phe Pro Arg Pro Gin Ser Leu Asp Leu 
1335 1340 

G iL^ a Thr Phe Thr Val Ala His Gln Leu Asp Asn Gin Ser Gin 
i 1350 1355 1360 

Pro Arg Asp Pro Gly Pro Gin Ser Ala Phe Ser Leu Pro Gly Ser Thr 

1365 1370 1375 

Gin His Leu Arg Ser Gin Leu Ser Gin Cys Lys Gin Arg Tyr Gin Asd 

138° 1385 1390 

Leu Gin Glu Lys Leu Leu Leu Ser Glu Ala Thr Val Phe Ala Gin Ala 

1395 1400 1405 

Asn Glu Leu Glu Lys Tyr Arg Val Met Leu Thr Gly Glu Ser Leu Val 

1410 1415 142Q 

Lys Gin Asp Ser Lys Gin He Gin Val Asp Leu Gin Asp Leu Gly Tyr 
1425 1430 1435 144Q 

Glu Thr Cys Gly Arg Ser Glu Asn Glu Ala Glu Arg Glu Glu Thr Thr 

1445 1450 i 4 55 

Ser Pro Glu Cys Glu Glu His Asn Ser Leu Lys Glu Met Val Leu Met 

1460 1465 147Q 

Glu Gly Leu Cys Ser Glu Gin Gly Arg Arg Gly Ser Thr Leu Ala Ser 

1475 148O 1485 

Ser Ser Glu Arg Lys Pro Leu Glu Asn Gin Leu Gly Lys Gin Glu Glu 

1490 1495 1500 

Phe Arg Val Tyr Gly Lys Ser Glu Asn He Leu Val Leu Arg Lys Asp 
:? 05 T „ 1510 1515 1520 

He Lys Asp Leu Lys Ala Gin Leu Gin Asn Ala Asn Lys Val He Gin 

1525 1530 1535 

Asn Leu Lys Ser Arg Val Arg Ser Leu Ser Val Thr Ser Asp Tyr Ser 

1540 1545 1550 

Ser Ser Leu Glu Arg Pro Arg Lys Leu Arg Ala Val Gly Thr Leu Glu 
1555 iseo 1565 

V f?J rt Ser Pr ° His Ser Val Pro ASP Glu Asp Glu Gly Trp Leu Ser 
157 ° 1575 1580 

Asp Gly Thr Gly Ala Phe Tyr Ser Pro Gly Leu Gin Ala Lys Lys Asp 
T 1585 , 159 0 1595 1600 

Leu Glu Ser Leu He Gin Arg Val Ser Gin Leu Glu Ala Gin Leu Pro 

1605 1610 1615 

Lys Asn Gly Leu Glu Glu Lys Leu Ala Glu Glu Leu Arg Ser Ala Ser 

1620 1625 1630 

Trp Pro Gly Lys Tyr Asp Ser Leu He Gin Asp Gin Ala Arg Glu Leu 

1635 1640 1645 

Ser Tyr Leu Arg Gin Lys He Arg Glu Gly Arg Gly He Cys Tvr Leu 

1650 1655 1660 

He Thr Arg His Ala Lys Asp Thr Val Lys Ser Phe Glu Asp Leu Leu 
l 66S 1670 1675 1680 

Arg Ser Asn Asp He Asp Tyr Tyr Leu Gly Gin Ser Phe Arg Glu Gin 

1685 1690 1695 

Leu Ala Gin Gly Ser Gin Leu Thr Glu Arg Leu Thr Ser Lys Leu Ser 
1700 1705 1710 
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Thr Lys Asp His Lys Ser Glu Lys Asp Gin Ala Gly Leu Glu Pro Leu 

1715 1720 1725 

Ala Leu Arg Leu Ser Arg Glu Leu Gin Glu Lys Glu Lys Val lie Glu 

1730 1735 1740 

Val Leu Gin Ala Lys Leu Asp Ala Arg Ser Leu Thr Pro Ser Ser Ser 
1745 1750 1755 1760 

His Ala Leu Ser Asp Ser His Arg Ser Pro Ser Ser Thr Ser Phe Leu 

1765 1770 1775 

Ser Asp Glu Leu Glu Ala Cys Ser Asp Met Asp lie Val Ser Glu Tyr 

1780 1785 1790 

Thr His Tyr Glu Glu Lys Lys Ala Ser Pro Ser His Ser Asp Ser lie 

1795 1800 1805 

His His Ser Ser His Ser Ala Val Leu Ser Ser Lys Pro Ser Ser Thr 

1810 1815 1820 

Ser Ala Ser Gin Gly Ala Lys Ala Glu Ser Asn Ser Asn Pro He Ser 
1825 1830 1835 i 84 o 

Leu Pro Thr Pro Gin Asn Thr Pro ^ys Glu Ala Asn Gin Ala His Ser 

1845 1850 1855 

Gly Phe His Phe His Ser He Pro Lys Leu Ala Ser Leu Pro Gin Ala 

I860 1865 1870 

Pro Leu Pro Ser Ala Pro Ser Ser Phe Leu Pro Phe Ser Pro Thr Gly 

1875 1880 1885 

Pro Leu Leu Leu Gly Cys Cys Glu Thr Pro Val Val Ser Leu Ala Glu 

1890 1895 1900 

Ala Gin Gin Glu Leu Gin Met Leu Gin Lys Gin Leu Gly Glu Ser Ala 
1905 1910 1915 1920 

Ser Thr Val Pro Pro Ala Ser Thr Ala Thr Leu Leu Ser Asn Asp Leu 

1925 1930 1935 

Glu Ala Asp Ser Ser Tyr Tyr Leu Asn Ser Ala Gin Pro His Ser Pro 

1940 1945 1950 

Pro Arg Gly Thr He Glu Leu Gly Arg He Leu Glu Pro Gly Tyr Leu 

1955 I960 1965 

Gly Ser Ser Gly Lys Trp Asp Val Met Arg Pro Gin Lys Gly Ser Val 

1970 1975 1980 

Ser Gly Asp Leu Ser Ser Gly Ser Ser Val Tyr Gin Leu Asn Ser Lys 
1985 1990 1995 2000 

Pro Thr Gly Ala Asp Leu Leu Glu Glu His Leu Gly Glu He Arg Asn 

2005 2010 2015 

Leu Arg Gin Arg Leu Glu Glu Ser He Cys He Asn Asp Arg Leu Arg 

2020 2025 2030 

Glu Gin Leu Glu His Arg Leu Thr Ser Thr Ala Arg Gly Arg Gly Ser 

2035 2040 2045 

Thr Ser Asn Phe Tyr Ser Gin Gly Leu Glu Ser He Pro Gin Leu Cys 

2050 2055 2060 

Asn Glu Asn Arg Val Leu Arg Glu Asp Asn Arg Arg Leu Gin Ala Gin 
2065 2070 2075 2080 

Leu Ser His Val Ser Arg Glu His Ser Gin Glu Thr Glu Ser Leu Arg 

2085 2090 2095 

Glu Ala Leu Leu Ser Ser Arg Ser His Leu Gin Glu Leu Glu Lys Glu 

2100 2105 2110 

Leu Glu His Gin Lys Val Glu Arg Gin Gin Leu Leu Glu Asp Leu Arg 

2115 2120 2125 

Glu Lys Gin Gin Glu Val Leu His Phe Arg Glu Glu Arg Leu Ser Leu 

2130 2135 2140 

Gin Glu Asn Asp Ser Ser Gly Pro Cys Leu Ser Leu Val Arg Leu Gin 
2145 2150 2155 2160 

His Lys Leu Val Leu Leu Gin Gin Gin Cys Glu Glu Lys Gin Gin Leu 

2165 2170 2175 

Phe Glu Ser Leu Gin Ser Glu Leu Gin He Tyr Glu Ala Leu Tyr Gly 

2180 21B5 2190 

Asn Ser Lys Lys Gly Leu Lys Ala Tyr Ser Leu Asp Ala Cys His Gin 

2195 2200 2205 

He Pro Leu Ser Ser Asp Leu Ser His Leu Val Ala Glu Val Arg Ala 
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2210 2215 2220 

Leu Arg Gly Gin Leu Glu Gin Ser He Gin Gly Asn Asn Cys Leu Ara 
22 25 2230 2235 2240 

Leu Gin Leu Gin Gin Gin Leu Glu Ser Gly Ala Gly Lys Ala Ser Leu 

2245 2250 2255 

Ser Pro Ser Ser lie Asn Gin Asn Phe Pro Ala Ser Thr Asp Pro Gly 

2260 2265 2270 

Asn Lys Gin Leu Leu Leu Gin Asp Ser Ala Val Ser Pro Pro Val Arq 

2275 2280 2285 

Asp Val Gly Met Asn Ser Pro Ala Leu Val Phe Pro Ser Ser Ala Ser 

2290 2295 2300 

Ser Thr Pro Gly Ser Glu Thr Pro He He Asn Arg Ala Asn Glv Leu 
2305 2310 2315 2320 

Gly Leu Asp Thr Ser Pro Val Met Lys Thr Pro Pro Lys Leu Glu Gly 

2325 2330 2335 

Asp Ala Thr Asp Gly Ser Phe Ala Asn Lys His Gly Arg His Val He 

2340 2345 2350 

Gly His He Asp Asp Tyr Ser Ala Leu Arg Gin Gin He Ala Glu Gly 

2355 2360 2365 

Lys Leu Leu Val Lys Lys He Val Ser Leu Val Arg Ser Ala Cys Ser 

2370 2375 2380 

Phe Pro Gly Leu Glu Ala Gin Gly Thr Glu Val Leu Gly Ser Lys Gly 
2385 2390 2395 2400 

He His Glu Leu Arg Ser Ser Thr Ser Ala Leu His His Ala Leu Glu 

2405 2410 2415 

Glu Ser Ala Ser Leu Leu Thr Met Phe Trp Arg Ala Ala Leu Pro Ser 

2420 2425 2430 

Thr His He Pro Val Leu Pro Gly Lys Val Gly Glu Ser Thr Glu Arg 

2435 2440 2445 

Glu Leu Leu Glu Leu Arg Thr Lys Val Ser Lys Gin Glu Arg Leu Leu 

2450 2455 2460 

Gin Ser Thr Thr Glu His Leu Lys Asn Ala Asn Gin Gin Lys Glu Ser 
2465 2470 2475 2480 

Met Glu Gin Phe He Val Ser Gin Leu Thr Arg Thr His Asp Val Leu 

2485 2490 2495 

Lys Lys Ala Arg Thr Asn Leu Glu Val Lys Ser Leu Arg Ala Leu Pro 

2500 2505 2510 

Cys Thr Pro Ala Leu 
2515 



<210> 6 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primers 



<400> 6 

cggaattcga ggaggcctac cagaaac 

<210> 7 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primers 



<400> 7 

tgagtcgact acgtgtcaag gcaacaatgg tc 
<210> 8 
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<211> 1683 
<212> PRT 
<213> rat 

<400> 8 

Met Met Ala Gin Phe Pro Thr Ala Met Asn Gly Gly Pro Asn Met Trp 

1 5 10 is 

Ala He Thr Ser Glu Glu Arg Thr Lys His Asp Lys Gin Phe Asp Asn 

20 25 30 

Leu Lys Pro Ser Gly Gly Tyr He Thr Gly Asp Gin Ala Arg Thr Phe 

35 40 45 

Phe Leu Gin Ser Gly Leu Pro Ala Pro Val Leu Ala Glu He Trp Ala 

50 55 60 

Leu Ser Asp Leu Asn Lys Asp Gly Lys Met Asp Gin Gin Glu Phe Ser 
65 70 75 80 

He Ala Met Lys Leu He Lys Leu Lys Leu Gin Gly Gin Gin Leu Pro 

35 90 95 

Val Val Leu Pro Pro He Met Lys Gin Pro Pro Met Phe Ser Pro Leu 

100 105 HO 

He Ser Ala Arg Phe Gly Met Gly Ser Met Pro Asn Leu Ser He His 

115 120 125 

Gin Pro Leu Pro Pro Val Ala Pro He Thr Ala Pro Leu Ser Ser Ala 

130 135 140 

Thr Ser Gly Thr Ser He Pro Pro Leu Met Met Pro Ala Pro Leu Val 
145 150 155 160 

Pro Ser Val Ser Thr Ser Ser Leu Pro Asn Gly Thr Ala Ser Leu He 

165 170 175 

Gin Pro Leu Ser He Pro Tyr Ser Ser Ser Thr Leu Pro His Ala Ser 

180 185 190 

Ser Tyr Ser Leu Met Met Gly Gly Phe Gly Gly Ala Ser He Gin Lys 

195 200 205 

Ala Gin Ser Leu He Asp Leu Gly Ser Ser Ser Ser Thr Ser Ser Thr 

210 215 220 

Ala Ser Leu Ser Gly Asn Ser Pro Lys Thr Gly Thr Ser Glu Trp Ala 
225 230 235 240 

Val Pro Gin Pro Ser Arg Leu Lys Tyr Arg Gin Lys Phe Asn Ser Leu 

245 250 255 

Asp Lys Ser Met Ser Gly Tyr Leu Ser Gly Phe Gin Ala Arg Asn Ala 

260 265 270 

Leu Leu Gin Ser Asn Leu Ser Gin Thr Gin Leu Ala Thr lie Trp Thr 

275 280 285 

Leu Ala Asp He Asp Gly Asp Gly Gin Leu Lys Ala Glu Glu Phe He 

290 295 300 

Leu Ala Met His Leu Thr Asp Met Ala Lys Ala Gly Gin Pro Leu Pro 
305 310 315 320 

Leu Thr Leu Pro Pro Glu Leu Val Pro Pro Ser Phe Arg Gly Gly Lys 

325 330 335 

Gin He Asp Ser He Asn Gly Thr Leu Pro Ser Tyr Gin Lys Thr Gin 

340 345 350 

Glu Glu Glu Pro Gin Lys Lys Leu Pro Val Thr Phe Glu Asp Lys Arg 

355 360 365 

Lys Ala Asn Tyr Glu Arg Gly Asn Met Glu Leu Glu Lys Arg Arg Gin 

370 375 380 

Val Leu Met Glu Gin Gin Gin Arg Glu Ala Glu Arg Lys Ala Gin Lys 
385 390 395 400 

Glu Lys Glu Glu Trp Glu Arg Lys Gin Arg Glu Leu Gin Glu Gin Glu 

405 . 410 415 

Trp Lys Lys Gin Leu Glu Leu Glu Lys Arg Leu Glu Lys Gin Arg Glu 

420 425 430 

Leu Glu Arg Gin Arg Glu Glu Glu Arg Arg Lys Glu He Glu Arg Arg 

.435 440 445 

Glu Ala Ala Lys Gin Glu Leu Glu Arg Gin Arg Arg Leu Glu Trp Glu 
450 455 460 
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Arg He Arg Arg Gin Glu Leu Leu Asn Gin Lys Asn Arg Glu Gin Glu 

470 475 
Glu He Val Arg Leu Asn Ser Lys Lys Lys Ser Leu His Leu Glu Leu 

485 490 495 

Glu Ala Val Asn Gly Lys His Gin Gin He Ser Gly Arg Leu Gin Asd 

500 505 sio 

Val Arg lie Arg Lys Gin Thr Gin Lys Thr Glu Leu Glu Val Leu Asd 

515 520 525 

Lys Gin Cys Asp Leu Glu He Met Glu He Lys Gin Leu Gin Gin Glu 
"° 535 540 



Leu Gin Glu Tyr Gin Asn Lys Leu He Tyr Leu Val Pro Glu Lys Gin 
550 555 sen 

Leu Leu Asn Glu Arg He Lys Asn Met Gin Leu Ser Asn Thr Pro Asd 

565 570 575 

Ser Gly He Ser Leu Leu His Lys Lys Ser Ser Glu Lys Glu Glu Leu 

580 585 590 

Cys Gin Arg Leu Lys Glu Gin Leu Asp Ala Leu Glu Lys Glu Thr Ala 

595 60*0 605 

Ser Lys Leu Ser Glu Met Asp Ser Phe Asn Asn Gin Leu Lys Cys Glv 

610 615 620 

Asn Met Asp Asp Ser Val Leu Gin Cys Leu Leu Ser Leu Leu Ser Cvs 
630 635 640 

Leu Asn Asn Leu Phe Leu Leu Leu Lys Glu Leu Arg Glu Ser Tyr Asn 

645 650 
Thr Gin Gin Leu Ala Leu Glu Gin Leu His Lys He Lys Arg As P Lvs 

66 ° 665 670 

Leu Lys Glu Leu Glu Arg Lys Arg Leu Glu Gin He Gin Lys Lys Lvs 

675 680 685 

Leu Glu Asp Glu Ala Ala Arg Lys Ala Lys Gin Gly Lys Glu Asn Leu 

690 , 695 700 

Trp Lys Glu Ser He Arg Lys Glu Glu Glu Glu Lys Gin Lys Arg Leu 
: 5 710 715 720 

Gin Glu Glu Lys Ser Gin Asp Arg Thr Gin Glu Glu Glu Arg Lys Thr 

725 730 735 

Glu Ala Lys Gin Ser Glu Thr Ala Arg Ala Leu Val Asn Tyr Arg Ala 

740 745 750 

Leu Tyr Pro Phe Glu Ala Arg Asn His Asp Glu Met Ser Phe Asn Ser 

755 760 765 

Gly Asp He He Gin Val Asp Glu Lys Thr Val Gly Glu Pro Gly Trp 

770 775 780 

Leu Tyr Gly Ser Phe Gin Gly Lys Phe Gly Trp Phe Pro Cys Asn Tyr 
5 790 795 800 

Val Glu Lys Met Leu Ser Ser Asp Lys Thr Pro Ser Pro Lys Lys Ala 

805 810 815 

Leu Leu Pro Pro Ala Val Ser Leu Ser Ala Thr Ser Ala Ala Pro Gin 
e r 820 825 830 

Pro Leu Cys Ser Asn Gin Pro Ala Pro Val Thr Asp Tyr Gin Asn Val 

835 840 845 

Ser Phe Ser Asn Leu Asn Val Asn Thr Thr Trp Gin Gin Lys Ser Ala 

850 855 860 

Phe Thr Arg Thr Val Ser Pro Gly Ser Val Ser Pro He His Gly Gin 
865 87( > 875 880 



o / o 880 
Gly Gin Ala Val Glu Asn Leu Lys Ala Gin Ala Leu Cys Ser Trp Thr 
885 890 - — 

Lys His Asp 

jwj 910 

Val Leu Glu Gin Gin Glu Asn Trp Trp Phe Gly Glu Val His Glv Glv 

915 QTfl ~~ - 1 * 



ooo 890 895 

Ala Lys Lys Glu Asn His Leu Asn Phe Ser Lys His Asp Val He Thr 
900 905 



915 920 925 

Arg Gly Trp Phe Pro Lys Ser Tyr Val Lys He He Pro Gly Ser Glu 

930 935 940 

Val Lys Arg Gly Glu Pro Glu Ala Leu Tyr Ala Ala Val Asn Lys Lys 
945 t 950 955 * 960 

Pro Thr Ser Thr Ala Tyr Pro Val Gly Glu Glu Tyr He Ala Leu Tyr 
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Ser Tyr Ser Ser vS Glu Pro Gly Asp "2 Thr Phe Thr Glu tty Glu 

0 985 990 

Glu Leu Leu Val Thr Gin Lys Asp Gly Glu Trp Trp Thr Gly Ser lie 

1000 inns 
Gly Glu Arg Thr Gly lie Phe Pro Ser Asa Tyr Val Arg Pro Lys Asp 

AU 1015 1020 

Gin Glu Asn Val Gly Asn Ala Ser Lys Ser Gly Ala Ser Asn Lys Lys 

1035 irun 
Pro Glu lie Ala Gin Val Thr Ser Ala Tyr Ala Ala Ser Gly Ala Glu 
^ , 1045 1050 1055 

Gin Leu Ser Leu Ala Pro Gly Gin Leu lie Leu lie Leu Ly 3 Lys Asn 

1060 1065 !07 0 

Ser Ser Gly Trp Trp Gin Gly Glu Leu Gin Ala Arg Gly Lys Lys Arg 
„ T , 5 10 80 1085 

tnon 7 PhC Pr ° Ma Ser His Val ^ Leu Le " ^y Pro Ser 

I? 90 1095 1100 

Axa Glu Arg Thr Thr Pro Ala Phe His Ala Val Cys Gin Val lie Ala 

t ~. „ , 1110 ins 1120 

Met Tyr Asp Tyr lie Ala Asn Asn Glu Asp Glu Leu Asn Phe Ser Lys 

1125 1130 1135 

Gly Gin Leu lie Asn Val Met Asn Lys Asp Asp Pro Asp Trp Trp Gin 

1140 1145 H50 

Gly Glu lie A S „ G ly Val Thr Gly Leu Phe Pro Ser Asn Tyr Val Lys 

1155 U60 1165 J y 

Met Thr Thr Asp Ser Asp Pro Ser Gin Gin Trp Cys Ala Asp Leu Gin 

AA/U H75 use 

Ala Leu Asp Thr Met Gin Pro Met Glu Arg Lys Arg Gin Gly Tyr lie 
ti „, 1190 1195 -1200 

His Glu Leu He Glu Thr Glu Glu Arg Tyr Met Asp Asp Leu Gin Leu 

1205 1210 1215 

Val He Glu Val Phe Gin Lys Arg Met Ala Glu Ser Gly Phe Leu Thr 

I 220 1225 1230 

Glu Ala Glu Met Ala Leu He Phe Val Asn Trp Lys Glu Leu He Met 
1235 1240 1245 

^?n Thr Ly3 LSU LSU Lys Ma Leu Val Arg Lys Lys Thr Gly 
n , \ 1255 1260 

Gly Glu Lys Met Pro Val Glu Met Met Gly Asp He Leu Ala Ala Glu 

7 1270 i27 5 1280 

Leu Ser His Met Gin Ala Tyr He Arg Phe Cys Ser Cys Gin Leu Asn 

1285 1290 1295 

Gly Ala Ala Leu Leu Gin Gin Lys Thr Asp Glu Asp Ala Asp Phe Lvs 

1300 1305 i 3 J 0 y 

Glu Phe Leu Lys Lys Leu Ala Ser Asp Pro Arg Cys Lys Gly Met Pro 
1315 1320 1325 

^ 1330 SSr ^335 Pr ° Arg 116 Thr Arg T * r Pro 

Leu Leu He Arg Ser lie Leu Glu Asn Thr Pro Gin Asn His Val Asp 

1350 1355 13 go 

His Ser Ser Leu Lys Leu Ala Leu Glu Arg Ala Glu Glu Leu Cys Ser 

1365 1 37 0 1375 

Gin Val Asn Glu Gly Val Arg Glu Lys Glu Asn Ser Asp Arg Leu Glu 
1380 1385 1390 



j.jo3 1390 
Trp He Gin Ala His Val Gin Cys Glu Gly Leu Ala Glu Gin Leu He 

1395 1400 1405 

Phe Asn Ser Leu Thr Asn Cys Leu Gly Pro Arg Lys Leu Leu Tyr Ser 

1415 1420 
Gly Lys Leu Tyr Lys Thr Lys Ser Asn Lys Glu Leu His Gly Phe Leu 
ll\ 1430 1435 1440 

Phe Asn Asp Phe Leu Leu Leu Thr Tyr Leu Val Arg Gin Phe Ala Ala 

1445 1450 1455 

Ser Ser Gly Phe Glu Lys Leu Phe Ser Ser Lys Ser Ser Ala Gin Phe 
1460 1465 1470 
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Lys Met Tyr Lys Thr Pro lie Phe Leu Asn Glu Val Leu Val Lys Leu 
1475 1480 i4 8 5 1 

PX ° 1490^ l495 GlU PhS HiS Ile Ser His Ile 

Asp Arg Val Tyr Thr Leu Arg Thr Asp Asn Ile A^Glu Arg Thr Ala 
Z 1510 i 515 , 52 

Trp Val Gin Lys lie Lys Ala Ala Ser Glu Gin Tyr Ile Asp Thr Glu 

1525 1530 1535 

Lys Lys Lys Arg Glu Lys Ala Tyr Gin Ala Arg Ser Gin Lys Thr Ser 

15 40 1545 
Gly Ile Gly Arg Leu Met Val His Val lie Glu Ala Thr Glu Leu Lys 
1555 1560 i5 6 5 

?£L LyS Pr ° Asn Gly Lys Ser Pro T y r c ys Glu Ile Ser Met 
^ 1570 157S i58o 

Gly Ser Gin Ser Tyr Thr Thr Arg Thr Leu Gin Asp Thr Leu Asn Pro 
1585 1590 1595 1600 

Lys Trp Asn Phe Asn Cys Gin Phe Phe Ile Lys Asp Leu Tyr Gin Asp 

1605 1610 1615 

Val Leu Cys Leu Thr Met Phe Asp Arg Asp Gin Phe Ser Pro Asp Asp 
1620 1625 i 6 3 0 * y 

Phe Leu Gly Arg Thr Glu Val Pro Val Ala Lys Ile Arg Thr Glu Gin 

1635 1640 1645 

Glu Ser Lys Gly Pro Thr Thr Arg Arg Leu Leu Leu His Glu Val Pro 

iSSO 1655 i 66 o 

Thr Gly Glu Val Trp Val Arg Phe Asp Leu Gin Leu Phe Glu Gin Lys 

1665 1670 1675 1680 

Thr Leu Leu 1680 
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